Clear, concise implementations of state-of-the-art reinforcement learning algorithms in Tensorflow 2.
- Vanilla Policy Gradient
- Trust Region Policy Optimization
- Proximal Policy Optimization
- Deep Deterministic Policy Gradient
- Twin Delayed DDPG
- Soft Actor Critic