Skip to content

yuishihara/chainer-td3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

chainer-td3

Reproduction codes of Twin Delayed Deep Deterministic policy gradient (TD3) with chainer

About

This repo is a TD3 reproduction codes writen with chainer. See this original paper for details

How to train

Will be trained with CPU by default

$ python3 main.py --env="walker2d-v2"

Results

May require to export below variable before running the code in linux environment.

$ export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so

Walker2d-v2

$ python3 main.py --test-run --pi-params=trained_results/mujoco/walker2d-v2/pi_final_model
result score
walker2d-v2-result walker2d-v2-graph

Ant-v2

$ python3 main.py --test-run --pi-params=trained_results/mujoco/ant-v2/pi_final_model
result score
ant-v2-result ant-v2-graph

About

Reproduction codes of Twin Delayed Deep Deterministic policy gradient (TD3) with chainer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages