Skip to content

Implemented a deep deterministic policy gradient with a neural network for the OpenAI gym pendulum environment.

Notifications You must be signed in to change notification settings

nrod80/ddpg-for-openai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DDPG for OpenAI Gym Environments

In this casual four day hackathon:

  • As a warm up, I created an extremely simple learning algorithm for the cartpole environment.

  • I implemented a deep deterministic policy gradient with a neural network for the OpenAI gym pendulum environment. read more about DDPG https://arxiv.org/pdf/1509.02971v5.pdf

  • I (fruitlessly) attempted to extend the DDPG to the humanoid environment. (the result is quite funny if you run it).

This project marks my first experience with TensorFlow, though I had previously implemented neural networks in other languages.

About

Implemented a deep deterministic policy gradient with a neural network for the OpenAI gym pendulum environment.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages