Skip to content

seanahmad/deep-rl

 
 

Repository files navigation

Deep Reinforcement Learning

Repository implementing deep reinforcement learning and conducting experiments on the OpenAI environments

Setup

# Clone repo
git clone git@github.com:eugeneyan/deep_rl.git && cd deep_rl

# Create conda environment
conda env create -f=environment.yml

# Activate environment
source activate deep_rl

box2D errors

If a box2D error is encountered (e.g., AttributeError: module '_Box2D' has no attribute 'RAND_LIMIT_swigconstant'), please follow the steps below:

pip uninstall Box2D box2d-py
git clone https://github.com/pybox2d/pybox2d
cd pybox2d/
python setup.py clean
python setup.py build
python setup.py install

More details here: openai/gym#100

Environments

Currently, the following environments are supported:

Models

Running experiments

Running experiments is a simple as

# To run experiment on cartpole environment
python cartpole_runner.py

# To run experiment on lunar lander environment 
python lunarlander_runner.py

Usage

python lunarlander_runner.py --help

usage: lunarlander_runner.py [-h] [--model MODEL] [--render-env RENDER_ENV]
                             [--render-freq RENDER_FREQ]

optional arguments:
  -h, --help            show this help message and exit
  --model MODEL         DeepRL model to use (options: dqn_plain, dqn, ddqn;
                        default: ddqn)
  --render-env RENDER_ENV
                        Whether to render the environment (default: y)
  --render-freq RENDER_FREQ
                        How frequently to render the env (default: 500)
                        --render-env must be set to "y" to render environment

Industrial applications of Deep RL

References

  • C. Watkins “Learning from delayed rewards”, PhD Thesis, University of Cambridge, England, 1989

  • V. Mnih, et al. “Human-level control through deep reinforcement learning.” Nature 518(7540):529-533, 2015.

  • V. Minh, et al. “Playing atari with deep reinforcement learning.” CoRR abs/1312.5602, 2013.

  • H. van Hasselt, A. Guez, and D. Silver. “Deep reinforcement learning with double Q-Learning.” CoRR abs/1509.06461, 2015

About

Repository for deep reinforcement learning with OpenAI

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%