BasicReinforcementLearning

Simple Reinforcement learning example, based on the Q-function.

Rules: The agent (yellow box) has to reach one of the goals to end the game (green or red cell).
Rewards: Each step gives a negative reward of -0.04. The red cell gives a negative reward of -1. The green one gives a positive reward of +1.
States: Each cell is a state the agent can be.
Actions: There are only 4 actions. Up, Down, Right, Left.

The little triangles represent the values of the Q function for each state and each action. Green is positive and red is negative.

Demo (Q-Learning)

http://youtu.be/tiTR8F41_v0

Run

Three different agents are currently implemented.

Q-Learning

Run:

python QLearner.py

Sarsa

Run:

python SarsaLearner.py

Sarsa lambda

Run:

python SarsaLambdaLearner.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
NewLearner_Q_with_eligibility_trace.py		NewLearner_Q_with_eligibility_trace.py
QLearner.py		QLearner.py
README.md		README.md
SarsaLambdaLearner.py		SarsaLambdaLearner.py
SarsaLearner.py		SarsaLearner.py
World.py		World.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NewLearner_Q_with_eligibility_trace.py

NewLearner_Q_with_eligibility_trace.py

QLearner.py

QLearner.py

README.md

README.md

SarsaLambdaLearner.py

SarsaLambdaLearner.py

SarsaLearner.py

SarsaLearner.py

World.py

World.py

Repository files navigation

BasicReinforcementLearning

Demo (Q-Learning)

Run

Q-Learning

Sarsa

Sarsa lambda

About

Releases

Packages

Languages

PhilippeMorere/BasicReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

BasicReinforcementLearning

Demo (Q-Learning)

Run

Q-Learning

Sarsa

Sarsa lambda

About

Resources

Stars

Watchers

Forks

Languages