2048-rl

Deep Q-Learning Project to play 2048. See this presentation for an introduction.

Getting Started

Install TensorFlow, python & pip. Then, run:

pip install -r requirements.txt

To run the code, you'll need to update your PYTHONPATH:

source set_pythonpath.sh

Now, you should be able to run the tests:

py.test

Source Code Structure

All python source code lives in py_2048_rl.

game

This directory contains code to simulate the 2048 game itself. For example, it provides a Game class that implements the game logic. The play module defines the Experience class, a play() function and various strategies that can be passed as an argument to play().

learning

This directory contains all code that has to do with the Deep Q-Learning algorithm itself. Here's a comprehensive list of the modules:

replay_memory implements the Replay Memory. Main methods are add() to add an experience and sample() to sample a number of experiences.
experience_collector implements a collect(strategy, num_games) function that plays a number of games, deduplicates & undersamples the experiences, and returns them.
target_batch_computer is responsible for computing the target batch that is passed to the network.
experience_batcher uses the ReplayMemory, ExperienceCollector and TargetBatchComputer to generate training batches for the neural network.
model defines the Neural Network architecture and its training parameters (e.g. Learning Rate).
learning glues everything together to implement the Deep Q-Learning algorithm.

Run Training

Step 1 is to set various parameters.

For example, you might want to adjust

The GAMMA value in target_value_computer.py
The INIT_LEARNING_RATE or HIDDEN_SIZES in model.py
The MIN_EPSILON in experience_batcher.py
...

Once that's done, you can simple run python py_2048_rl/learning/learning.py <train_dir>.

Analyzing the Model

You can use TensorBoard to monitor your Network training, simply by passing you train directory as the --logdir param. Furthermore, have a look at py_2048_rl/analisis.py (for plotting a historgram of Q-Values) and py_2048_rl/play_game.py (for simulating a (number of) games given a particular model).

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
experiments		experiments
py_2048_rl		py_2048_rl
.editorconfig		.editorconfig
.gitignore		.gitignore
README.md		README.md
pylintrc		pylintrc
requirements.txt		requirements.txt
set_pythonpath.sh		set_pythonpath.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiments

experiments

py_2048_rl

py_2048_rl

.editorconfig

.editorconfig

.gitignore

.gitignore

README.md

README.md

pylintrc

pylintrc

requirements.txt

requirements.txt

set_pythonpath.sh

set_pythonpath.sh

Repository files navigation

2048-rl

Getting Started

Source Code Structure

game

learning

Run Training

Analyzing the Model

About

Releases

Packages

Languages

tjussh/2048-rl

Folders and files

Latest commit

History

Repository files navigation

2048-rl

Getting Started

Source Code Structure

game

learning

Run Training

Analyzing the Model

About

Resources

Stars

Watchers

Forks

Languages