PyTorch implementation of GAE and GAIL with PPO

This repository contains a Pytorch implementation of Generalized Advantage Estimation (GAE) and Generative Adversarial Imitation Learning (GAIL) with Proximal Policy Optimization (PPO)

Usage

For GAE, use

python gae.py --env-name Hopper-v1

For GAIL, use

python gail.py --env-name Hopper-v1 --expert-path hopper_expert_trajectories/ --batch-size 20000 --num-expert-trajs 10 --optim-epochs 5 --num-episodes 2000

For GAIL with Phase MLP architecture, use

python phase_gail.py --env-name Hopper-v1 --expert-path hopper_expert_trajectories/ --batch-size 20000 --num-expert-trajs 10 --optim-epochs 5 --num-episodes 2000

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
hopper_expert_trajectories		hopper_expert_trajectories
README.md		README.md
gae.py		gae.py
gail.py		gail.py
gru.py		gru.py
load_expert_traj.py		load_expert_traj.py
main.py		main.py
models.py		models.py
phase_gail.py		phase_gail.py
phase_mlp.py		phase_mlp.py
phase_mlp_multilayer_new_fast.py		phase_mlp_multilayer_new_fast.py
replay_memory.py		replay_memory.py
rnn_gae.py		rnn_gae.py
rnn_gail.py		rnn_gail.py
running_state.py		running_state.py
test_gail.py		test_gail.py
utils.py		utils.py

sharma-arjun/GAIL

Folders and files

Latest commit

History

Repository files navigation

PyTorch implementation of GAE and GAIL with PPO

Usage

About

Resources

Stars

Watchers

Forks

Languages