Description

This is a GAIL baselines what belong to Inverse Reinforcement Learning (IRL) methods.

As we all know GAN and GAIL are fragile, even the baseline code what is written by OpenAI is hard to train. Therefore, I write a GAIL code which is PyTorch edition. Besides, Because of the fragility of GAIL, I add some trick in code, what is inevitable, and the tricks are as flows:

Ｍy_GAIL_PyThorch

Requirements

mujoco-py==2.0.2.13
PyTorch==1.7.1
See more details in requirement.txt

Trick:

Memory: add a replay buffer to train generator,
Batch Normal: using batch normal trick to transform state , action and next state , note: this trick is used for train generator net ，instead of discriminator net.
Reward Function: if generator accuracy less than 0.5, then this indicates that the generator can not identify the generated data and exert data, thus the reward is optimal reward. Conversely the reward equals to reward function generated by discriminator.
Add noise : add noise to discriminator

Note:

The key to train GAIL is that balancing the discriminator and generator performance, a strong discriminator is not allowed, the discriminator should waiting for the generator.

Usage

python main.py  --env_name=Hopper-v2

note: By this way, you can only change the ==environment name==, the other parameters only can be changed in their ==yaml file==, the file path is =="./env_parser/"==.

Runs

Hopper-v2 (expert return = 3500)

HalfCheetah-v2(expert return = 6000)
Ant-v2 (expert return =5500 )
Walker2d-v2 (expert return = 4900)

InvertedPendulum((expert return = 1000)
InvertedDoublePendulum((expert return = 9359)

Generate Expert Demonstrations

This package can be used to generate expert demonstrations.

You can also download expert demonstration via link: Expert Demonstration

Reference

[SAC(pytorch-soft-actor-critic-master)]: https://github.com/pranz24/pytorch-soft-actor-critic

The websites of Four GAIL editions are as flows:

[gail-pytorch]:https://github.com/hcnoh/gail-pytorch.git

[PyTorch-RL]:https://github.com/Khrylx/PyTorch-RL.git

[imitation]:https://github.com/openai/imitation.git

[GAIL]:https://github.com/JiangengDong/GAIL.git

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
GAIL		GAIL
GAIL_pytorch-master		GAIL_pytorch-master
GenerateExpertDemonstration		GenerateExpertDemonstration
My_GAIL_Pytorch		My_GAIL_Pytorch
PyTorch-RL		PyTorch-RL
README.assets		README.assets
gail-pytorch		gail-pytorch
imitation		imitation
pytorch-soft-actor-critic-master		pytorch-soft-actor-critic-master
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Johnny-Zhang92/IRL-Essential-Code

Folders and files

Latest commit

History

Repository files navigation

Description

Ｍy_GAIL_PyThorch

Requirements

Trick:

Note:

Usage

Runs

Generate Expert Demonstrations

Reference

About

Topics

Resources

Stars

Watchers

Forks

Languages