Reinforcement Learning Experiments

This repo is just me playing around with reinforcement learning, running simulations. Most are based on "Reinforcement Learning - An Introduction" by Sutton and Barto.

It currently has a Monte Carlo black jack simulation from chapter 5, and an N Bandits simulation using different agent algorithms from chapter 2.

To set up the environment, run:

conda env create
source activate reinf

Tetris - Deep Reinforcement Learning

Use a deep reinforcement learning algorithm to learn a tetris playing agent.

Takes advantage of a Kears/Theano/Cuda stack to learn a neural net Q-function.

(In Progress)

python src/tetris/simulation.py

To run the debugging web app, install socat and websocketd, then run:

websocketd --port=8080 ./socket_server.sh

To use SSH tunnel to watch the learning process on a different server:

ssh -L 8082:localhost:8080 <user@server;> -p 24

Monte Carlo Blackjack (Chapter 5)

Uses e-greedy methods to find an optimal action value function and policy.

python src/monte_carlo_blackjack.py

The following was an attempt to recreate fig 5.5 in Sutton and Barto, using a Monte Carlo, "on-policy", e-greedy model. There seem to be a few minor differences I haven't worked out yet.

N-Bandits (Chapter 2)

Simulate the static n-bandits problem with e-greedy, e-greedy softmax, pursuit, and reinforcement comparison agents. All settings are in the settings/*.yml files, and should be self-evident

python src/runner.py simulations/<your_simulation>.yml

Name		Name	Last commit message	Last commit date
Latest commit History 180 Commits
.idea		.idea
data		data
hosting		hosting
images		images
simulations		simulations
src		src
visualization		visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
board_predictions.png		board_predictions.png
captains.log		captains.log
environment.yml		environment.yml
modelhistory.txt		modelhistory.txt
simple_strategies_scores.txt		simple_strategies_scores.txt
socket_server.sh		socket_server.sh
tetris_agent.py		tetris_agent.py

License

andreweskeclarke/reinforcement_learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Experiments

Tetris - Deep Reinforcement Learning

Monte Carlo Blackjack (Chapter 5)

N-Bandits (Chapter 2)

About

Resources

License

Stars

Watchers

Forks

Languages