OpenAI Gym experiments

My implementations of normalized advantage functions (NAF) for continuous actions spaces and dueling network architecture (DUEL) for discrete action spaces.

Example results with NAF:

Example results with DUEL:

Prerequisites

You will need:

Python 2.7
OpenAI Gym
Keras
Numpy
Scikit-Learn (if using imagination rollouts)

In Ubuntu that would be:

sudo apt-get install python-numpy python-sklearn
pip install --user gym keras

If you want to run Mujoco environments, you also need to acquire trial key and install the binaries. Then you can install Mujoco support for OpenAI Gym:

pip install --user gym[mujoco]

Running the code

There are three main starting points:

python duel.py <envid> - run DUEL against environment with discrete action space,
python naf.py <envid> - run NAF against environment with continuous action space,
python nag_ir.py <envid> - run NAF with imagination rollouts.

You can override default hyperparameters with command-line options, use -h to see them or check out the code.

Some other utility scipts:

python test.py <envid> - test script to run random actions against the environment,
python naf_search.sh - example how to run crude hyperparameter search for NAF,
python duel_search.sh - example how to run crude hyperparameter search for DUEL.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
a2c.py		a2c.py
a2c_atari.py		a2c_atari.py
atari_utils.py		atari_utils.py
buffer.py		buffer.py
duel.py		duel.py
duel_search.sh		duel_search.sh
irmodel.py		irmodel.py
naf.py		naf.py
naf_ir.py		naf_ir.py
naf_random_search.sh		naf_random_search.sh
naf_search.sh		naf_search.sh
pg.py		pg.py
random_search.py		random_search.py
test.py		test.py

License

tambetm/gymexperiments

Folders and files

Latest commit

History

Repository files navigation

OpenAI Gym experiments

Prerequisites

Running the code

About

Resources

License

Stars

Watchers

Forks

Languages