No longer maintained

This fork is no longer used for development, please go to https://github.com/watchernyu/spinningup-drl-prototyping instead. Thanks!

Soft Actor-Critic Pytorch Implementation

Soft Actor-Critic Pytorch Implementation, based on the OpenAI Spinup documentation and some of its code base. This is a minimal, easy-to-learn and well-commented Pytorch implementation, and recommended to be studied along with the OpenAI Spinup Doc. This SAC implementation is based on the OpenAI spinningup repo, and uses spinup as a dependency. Target audience of this repo is Pytorch users (especially NYU students) who are learning Soft Actor-Critic algorithm.

Setup environment:

To use the code you should first download this repo, and then install spinup:

the spinup documentation is here, you should read it to make sure you know the procedure: https://spinningup.openai.com/en/latest/user/installation.html

The only difference in installation is you want to install this forked repo, instead of the original repo, so when you are ready to install this in a virtualenv you should run the following commands instead:

git clone https://github.com/watchernyu/spinningup.git
cd spinningup
pip install -e .

The Pytorch version used is: 0.4.1, install pytorch: https://pytorch.org/

If you want to run Mujoco environments, you need to also install Mujoco and get a liscence. For how to install and run Mujoco on NYU's hpc cluster, check out my other tutorial: https://github.com/watchernyu/hpc_setup

Run experiment

The SAC implementation can be found under spinup/algos/sac_pytorch/

Run experiments with pytorch sac:

In the sac_pytorch folder, run the SAC code with python sac_pytorch

Or you can use a spinup experiment grid: a sample grid is given under spinningup/experiments/, you can run it with python sample_grid.py

Note: currently there is no parallel running for SAC (also not supported by spinup), so you should always set number of cpu to 1 when you use experiment grid.

The program structure, though in Pytorch has been made to be as close to spinup tensorflow code as possible so readers who are familiar with other algorithm code in spinup will find this one easier to work with. I also referenced rlkit's SAC pytorch implementation, especially for the policy and value models part, but did a lot of simplification.

Consult Spinup documentation for output and plotting:

https://spinningup.openai.com/en/latest/user/saving_and_loading.html

https://spinningup.openai.com/en/latest/user/plotting.html

Reference:

Original SAC paper: https://arxiv.org/abs/1801.01290

OpenAI Spinup docs on SAC: https://spinningup.openai.com/en/latest/algorithms/sac.html

rlkit sac implementation: https://github.com/vitchyr/rlkit

Acknowledgement

Great thanks to Josh Achiam, the author of OpenAI Spinning Up. I think the Spinning Up documentation/code is an incredibly good resource for learning DRL and it made my learning much more effective. And also huge thanks for helping me with some Spinup coding issues!

Below are original Spinning Up readme

==================================

Status: Active (under active development, breaking changes may occur)

Welcome to Spinning Up in Deep RL!

This is an educational resource produced by OpenAI that makes it easier to learn about deep reinforcement learning (deep RL).

For the unfamiliar: reinforcement learning (RL) is a machine learning approach for teaching agents how to solve tasks by trial and error. Deep RL refers to the combination of RL with deep learning.

This module contains a variety of helpful resources, including:

a short introduction to RL terminology, kinds of algorithms, and basic theory,
an essay about how to grow into an RL research role,
a curated list of important papers organized by topic,
a well-documented code repo of short, standalone implementations of key algorithms,
and a few exercises to serve as warm-ups.

Get started at spinningup.openai.com!

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
docs		docs
experiments		experiments
spinup		spinup
test		test
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
readme.md		readme.md
readthedocs.yml		readthedocs.yml
setup.py		setup.py
travis_setup.sh		travis_setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

experiments

experiments

spinup

spinup

test

test

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

readme.md

readme.md

readthedocs.yml

readthedocs.yml

setup.py

setup.py

travis_setup.sh

travis_setup.sh

Repository files navigation

No longer maintained

Soft Actor-Critic Pytorch Implementation

Setup environment:

Run experiment

Reference:

Acknowledgement

Welcome to Spinning Up in Deep RL!

About

Releases

Packages

Languages

License

watchernyu/spinningup

Folders and files

Latest commit

History

Repository files navigation

No longer maintained

Soft Actor-Critic Pytorch Implementation

Setup environment:

Run experiment

Reference:

Acknowledgement

Welcome to Spinning Up in Deep RL!

About

Resources

License

Stars

Watchers

Forks

Languages