myrl

reinforcement learning algorithm implementations

TODO: add NoisyNet learning curve

Currently implemented:

Vanilla DQN [1] [2]
Async DQN (Multi-processed DQN, follows architecture described in [7].)
Double DQN [3]
Dueling DQN [4]
Multi-step Q-learning DQN [5]
NoisyNet [6]

Installation

This script is tested on Ubuntu 16 (with NVIDIA Tesla K80) on the Google Cloud Platform.
pyenv (pyenv/pyenv-installer: This tool is used to install pyenv and friends.) is recommended for building a python environment.
atari-py requires cmake, zlib, etc. Install them first (e.g. apt-get install make cmake zlib1g-dev g++).

GPU support

see: Installation Guide — CuPy 4.3.0 documentation

Install CUDA on your host.
- CUDA Toolkit 9.2 Download | NVIDIA Developer
If you use cupy-recommended environment (https://docs-cupy.chainer.org/en/stable/install.html#recommended-environments), cuDNN and NCCL libraries are included in cupy wheels.
- $ pip install cupy-cuda92

Usage

python train.py myrl/configs/vanilla_dqn.toml PongNoFrameskip-v4
- for more detail see python train.py --help

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
myrl		myrl
performance_reference		performance_reference
.gitignore		.gitignore
README.md		README.md
history.png		history.png
logging.toml		logging.toml
matplotlibrc		matplotlibrc
requirements.txt		requirements.txt
train.py		train.py
visualize.py		visualize.py
visualize_multi.py		visualize_multi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

myrl

myrl

performance_reference

performance_reference

.gitignore

.gitignore

README.md

README.md

history.png

history.png

logging.toml

logging.toml

matplotlibrc

matplotlibrc

requirements.txt

requirements.txt

train.py

train.py

visualize.py

visualize.py

visualize_multi.py

visualize_multi.py

Repository files navigation

myrl

Installation

GPU support

Usage

References

About

Releases

Packages

Languages

keisuke-nakata/myrl

Folders and files

Latest commit

History

Repository files navigation

myrl

Installation

GPU support

Usage

References

About

Resources

Stars

Watchers

Forks

Languages