deep_rl_ale

This repo contains an implementation of this paper in TensorFlow. It also contains the option to use the double dqn loss function, as well as a parallel version that acts and learns simultaneously to speed up training.

Watch it play Pong, Breakout, Space Invaders, and Seaquest here

The code is still a little messy in some places, and will be cleaned up in the future, but there will probably not be any significant updates or changes until mid-May.

Dependencies/Requirements

An nVidia GPU with GDDR5 memory to train in a reasonable amount of time
Python 3
The Arcade Learning Environment for the emulator framework.
Tensorflow for gpu numerical computions and symbolic differentiation.
Linux/OSX, because Tensorflow doesn't support Windows.
Matplotlib and Seaborn for visualizations.
OpenCV for image scaling. Might switch to SciPy since OpenCV was a pain for me to install.
Any dependencies of the above software, of course, like NumPy.

How to run

From the top directory of the repo (dir with python files):

Training

$ python3 ./run_dqn.py <name_of_game> <name_of_algorithm/method> <name_of_agent_instance>

For example:

$ python3 ./run_dqn.py breakout dqn brick_hunter

Watching

$ python3 ./run_dqn.py <name_of_game> <name_of_algorithm/method> <name_of_saved_model> --watch Where <name_of_saved_model> is the <name_of_agent_instance> used during training. If you used any non-default settings, make sure to use the same ones when watching as well.

Running Notes

You can change many hyperparameters/settings by entering optional arguments. To get a list of arguments:

$ python3 ./run_dqn.py --h

By default rom files are expected to be in a folder titled 'roms' in the parent directory of the repo. You can pass a diferent directory as an argument or change the default in run_dqn.py.

Statistics and saved models are saved in the parent directory of the repo as well.

The default settings are very similar to those used in the DeepMond Nature paper. There are only a few small differences of which I am aware.

A full training run takes between 3 and 4 days on my nVidia GTX 970, depending on whether or not the parallel option is used. Parallel training speeds up training by ~30%, but I'm still testing how different things impact speed.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
__pycache__		__pycache__
README.md		README.md
atari_emulator.py		atari_emulator.py
dqn_agent.py		dqn_agent.py
experience_memory.py		experience_memory.py
experiment.py		experiment.py
parallel_dqn_agent.py		parallel_dqn_agent.py
parallel_q_network.py		parallel_q_network.py
q_network.py		q_network.py
record_stats.py		record_stats.py
run_dqn.py		run_dqn.py
visuals.py		visuals.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

README.md

README.md

atari_emulator.py

atari_emulator.py

dqn_agent.py

dqn_agent.py

experience_memory.py

experience_memory.py

experiment.py

experiment.py

parallel_dqn_agent.py

parallel_dqn_agent.py

parallel_q_network.py

parallel_q_network.py

q_network.py

q_network.py

record_stats.py

record_stats.py

run_dqn.py

run_dqn.py

visuals.py

visuals.py

Repository files navigation

deep_rl_ale

Dependencies/Requirements

How to run

Training

Watching

Running Notes

About

Releases

Packages

Languages

BenJamesbabala/deep_rl_ale

Folders and files

Latest commit

History

Repository files navigation

deep_rl_ale

Dependencies/Requirements

How to run

Training

Watching

Running Notes

About

Resources

Stars

Watchers

Forks

Languages