Reinforcement Learning Notes

My notes on reinforcement learning.

Update: I am implementing some new algorithms in private repos, so the list here is incomplete. I will come back to update this from time to time.

Plans (2017-12-04)

C51, distributional Q-learning
Solve Montezuma with re-weighted sampling
Move PPO into this repo

Done

Backlog

TRPO
A3C
Behavior Cloning
DAgger

On How to Ask for Help

I found textbook to be the most reliable source but it's easy to get lost in the chapters. So the best way to ask for guidance seem to be:

I'm reading Chapter xx and topic xx atm, what are the key things I should pay attention to?

Reference Readings

David Silver's RL course index
Berkeley RL course http://rll.berkeley.edu/deeprlcourse/
http://blog.shakirm.com/2015/11/machine-learning-trick-of-the-day-5-log-derivative-trick/
https://arxiv.org/pdf/1506.05254.pdf is a longer explanation of different viewpoints for taking derivatives.
Contextual bandits:
- http://hunch.net/?p=298
- https://getstream.io/blog/introduction-contextual-bandits/

Research Ideas

Curiosity as reward
Finding answers as reward
inferring intention
Learning to predict (lots of prior art. self-supervision)
Auxiliary supervision and Auxiliary modalities.
inverse reinforcement learning != imitation learning

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.cache/v/cache		.cache/v/cache
_setup		_setup
courses		courses
ge_dqn		ge_dqn
gym-sessions		gym-sessions
meta_learning_project		meta_learning_project
notes		notes
papers		papers
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.cache/v/cache

.cache/v/cache

_setup

_setup

courses

courses

ge_dqn

ge_dqn

gym-sessions

gym-sessions

meta_learning_project

meta_learning_project

notes

notes

papers

papers

.gitignore

.gitignore

Makefile

Makefile

README.md

README.md

Repository files navigation

Reinforcement Learning Notes

Plans (2017-12-04)

Done

Backlog

On How to Ask for Help

Reference Readings

Research Ideas

About

Releases

Packages

Languages

edchengg/reinforcement_learning_learning_notes

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Notes

Plans (2017-12-04)

Done

Backlog

On How to Ask for Help

Reference Readings

Research Ideas

About

Resources

Stars

Watchers

Forks

Languages