Skip to content

sts-sadr/reinforcement-learning-experiments

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

99 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement-Learning-An-Introduction

Implementations to supplement my reading of "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew Barto.

Ch 2 - Multi-armed Bandits

Ch 4 - Dynamic Programming

Ch 5 - Monte Carlo Methods

Ch 6 - Temporal-Difference Learning

Ch 7 - Multi-step Bootstrapping

Ch 8 - Planning and Learning with Tabular Methods

dqn - My own experiments in using q-networks to achieve generalization in more complex environments

environments - A collection of environment (fMDP) implementations. These environments extend OpenAI Gym's Environment class.

About

Implementations to supplement my reading of "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew Barto.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%