Skip to content

edchengg/reinforcement_learning_learning_notes

 
 

Repository files navigation

Reinforcement Learning Notes

My notes on reinforcement learning.

Update: I am implementing some new algorithms in private repos, so the list here is incomplete. I will come back to update this from time to time.

Plans (2017-12-04)

  • C51, distributional Q-learning
  • Solve Montezuma with re-weighted sampling
  • Move PPO into this repo

Done

  • DQN
    • prioritized replay
    • double Q-learning (or half Q-learning)
    • dueling networks
    • $\epsilon$-greedy with linear scheduling
  • Gradients, and REINFORCE algorithm
  • policy gradients
  • Setups
    • Get MuJoCo
    • setup OpenAI Gym on AWS (yay!:confetti_ball:)
    • install MuJoCo 🎊
    • install mujoco-py (need to upgrade to 1.50 now supports python 3.6)
  • make a list of concepts to keep track of

Backlog

  • TRPO
  • A3C
  • Behavior Cloning
  • DAgger

On How to Ask for Help

I found textbook to be the most reliable source but it's easy to get lost in the chapters. So the best way to ask for guidance seem to be:

I'm reading Chapter xx and topic xx atm, what are the key things I should pay attention to?

Reference Readings

Research Ideas

  • Curiosity as reward
  • Finding answers as reward
  • inferring intention
  • Learning to predict (lots of prior art. self-supervision)
  • Auxiliary supervision and Auxiliary modalities.
  • inverse reinforcement learning != imitation learning

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 61.5%
  • Jupyter Notebook 31.9%
  • Makefile 5.0%
  • Shell 1.6%