deepatari

_ _

Framework for developing and comparing reinforcement learning algorithms in the Atari game playing domain

This is a framework to support the development of algorithms for intelligent agents in the Atari game playing domain. The idea is to let researcher or programmer focus on the development of algorithms without having to think about the whole environment implementation and to facilitate and generalize the evaluation of these algorithms to make results comparable.

A big advantage is that you can simply write your own implementation using whatever library you prefer and it should seamlessly work with the framework. The following image shows how the framework is organized. So far you should be able to add another learner class by copying the skeleton for a new learner class from './deepatari/learner/skeleton.py' and run an experiment using the '--with name_of_your_learner' argument.

Based on:

Python
ALE (http://www.arcadelearningenvironment.org/)
OpenAI gym (https://openai.com/blog/openai-gym-beta/)

Tested on CentOS release 6.6 with Python 2.7.11

README contents

Installation

This guide will help install all the necessary software in a virtual environment generated with Anaconda. It is a step by step guide which might not be necessary for everyone. If you want to integrate the dependencies into your existing installation please do so.

Please be aware that I do not cover the installation of the Nvidia CUDA and CuDNN driver, please follow these instructions:

http://docs.nvidia.com/cuda/cuda-getting-started-guide-for-linux/

Create a working directory

Create a folder, where you download all necessary stuff and keep the rest of your system clean:

mkdir deepatari_stuff && cd deepatari_stuff

Prepare conda environment

If you haven't done it already, install anaconda with instructions from here:

http://conda.pydata.org/docs/installation.html

Create a conda environment with the basic packages and activate it:

conda create --name deepatariEnv python=2.7 pip numpy opencv matplotlib
source activate deepatariEnv

Add non standard packages

The rest of the packages should be installed in this order, because they downgrade certain packages which will be upgraded later on again.

Download and install Neon [3]:

git clone https://github.com/NervanaSystems/neon.git
cd neon && make sysinstall && cd ..

Download and install OpenAI gym [4]:

git clone https://github.com/openai/gym.git
cd gym && pip install -e '.[atari]' && cd ..

Install the latest Theano [6] version :

pip install --upgrade https://github.com/Theano/Theano/archive/master.zip

Install the latest Lasagne [2] for Theano :

pip install --upgrade https://github.com/Lasagne/Lasagne/archive/master.zip

Install Tensorflow (Ubuntu/Linux 64-bit, GPU enabled. Requires CUDA toolkit 7.5 and CuDNN v4. Other options on library page.) [5]:

pip install --upgrade https://storage.googleapis.com/tensorflow/linux/gpu/tensorflow-0.8.0-cp27-none-linux_x86_64.whl

Install Keras [1]:

pip install keras

Install deepatari

The setup routine installs some runnable scripts to use directly from the command line:

git clone git@github.com:cowhi/deepatari.git
cd deepatari && python setup.py install && cd ..

Now you should be able to test the installation.

Running the software

After the installation you are ready to run your experiments.

One way to test the software with the minimal settings is to run the following command in any directory. The program will run and create an individual folder for each experiment as a subfolder of the 'results' directory, which will be made if it does not exist. This example uses the Neon implementation of the original DQN because it supports running on CPU in contrast to the Lasagne implementation. If you have a compatible Nvidia GPU, you can ran any implementation directly on the GPU.

learn_to_play --with DQNNeon --fill_mem_size 128 --train_steps 128 --test_steps 64 --epochs 2 --log_type stdout --log_stats False --backend cpu

Sources & Inspirations

I want to thank the authors of the following packages, framework and algorithms, which served as an inspiration and were the basis for some of the algorithms implementations.

Original code

Provided by Google Deep Mind under

https://sites.google.com/a/deepmind.com/dqn/

based on their paper:

Mnih, Volodymyr, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves et al. "Human-level control through deep reinforcement learning." Nature 518, no. 7540 (2015): 529-533. http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html

Other implementations

The following packages where heavily used for the respective learners:

deep_q_rl: DQN implementation based on Theano and Lasagne

https://github.com/spragunr/deep_q_rl
simple_dqn: DQN implementation based on Neon

https://github.com/tambetm/simple_dqn

The following packages are also interesting:

AgentNet: A lightweight library to build and train deep reinforcement learning

https://github.com/yandexdataschool/AgentNet
sherjilozair/dqn: Basic DQN implementation, which uses OpenAI's gym environment and Keras/Theano

https://github.com/sherjilozair/dqn
DEER: DEEp Reinforcement learning framework (very similar to this package)

https://github.com/VinF/deer

Community

Deep Q-Learning: Discussion group for DQN (mostly deep_q_rl)

https://groups.google.com/forum/#!forum/deep-q-learning

Others

RLPy: Framework for conducting sequential decision making experiments

https://github.com/rlpy/rlpy
PyBrain: Python Machine Learning Library with Reinforcement Learning

https://github.com/pybrain/pybrain/tree/master/pybrain/rl

References

[1] Keras: http://keras.io/

[2] Lasagne: http://lasagne.readthedocs.io/en/latest/

[3] Nervana Neon: http://www.nervanasys.com/technology/neon/

[4] OpenAI: https://openai.com/blog/openai-gym-beta/

[5] Tensorflow: https://www.tensorflow.org/

[6] Theano: http://deeplearning.net/software/theano/

Todo's

Video playback (see: https://github.com/tambetm/simple_dqn)
Record videos (see: https://github.com/openai/gym/blob/master/README.rst#id11)
Prioritized Replay Memory (see: https://github.com/VinF/deer/blob/master/deer/helper/tree.py)
Multi Agent support (see: https://github.com/yandexdataschool/AgentNet)
Double DQN (see https://github.com/VinF/deer/blob/master/deer/q_networks/q_net_keras.py)

Known problems

Program does not run because of a problem with h5py

Error message:

...
File "h5py/h5p.pyx", line 72, in h5py.h5p.propwrap (/tmp/pip-build-5MMDj7/h5py/h5py/h5p.c:2407)
ValueError: Not a property list class (Not a property list class)

Possible fix (install more recent version of h5py from non-standard repo):

conda install -c conda-forge h5py=2.6.0

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
deepatari		deepatari
docs		docs
.gitignore		.gitignore
.travis.yml		.travis.yml
AUTHORS.rst		AUTHORS.rst
CHANGES.rst		CHANGES.rst
LICENSE.txt		LICENSE.txt
README.rst		README.rst
experiment_setup.png		experiment_setup.png
requirements.txt		requirements.txt
requirements.txt_old		requirements.txt_old
setup.cfg		setup.cfg
setup.py		setup.py
test-requirements.txt		test-requirements.txt

License

cowhi/deepatari

Folders and files

Latest commit

History

Repository files navigation