Policy Gradient with Recurrent Neural Network (RNN)

Modular implementation of Vanila Policy Gradient (VPG) algorithm with an RNN policy.

Dependencies

Python 2.7 or 3.5
TensorFlow 1.10
gym
numpy
tqdm progress-bar

Features

Using an RNN policy for giving the action probabilities for a reinforcement learning problem
Using a sampler that reshape the trajectory to be feed into an RNN policy
Using gradient clipping to solve the exploding gradient problem
Using GRU to solve the vanishing gradient problem

Usage

To train a model for Cartpole-v0:

$ python run_pg_rnn.py

To view the tensorboard

$tensorboard --logdir .

Results

Tensorboard Progress Bar

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
configuration.json		configuration.json
model.py		model.py
pg_rnn.py		pg_rnn.py
run_pg_rnn.py		run_pg_rnn.py
sampler.py		sampler.py
tb_pg_rnn.JPG		tb_pg_rnn.JPG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

configuration.json

configuration.json

model.py

model.py

pg_rnn.py

pg_rnn.py

run_pg_rnn.py

run_pg_rnn.py

sampler.py

sampler.py

tb_pg_rnn.JPG

tb_pg_rnn.JPG

Repository files navigation

Policy Gradient with Recurrent Neural Network (RNN)

Dependencies

Features

Usage

Results

About

Releases

Packages

Languages

csawtelle/pg_rnn

Folders and files

Latest commit

History

Repository files navigation

Policy Gradient with Recurrent Neural Network (RNN)

Dependencies

Features

Usage

Results

About

Resources

Stars

Watchers

Forks

Languages