Hoppingbot

Implementation of Guided Policy Search (GPS) on Hopping Bot.

Unstable

This repo has been public for educational purposes and contribution are welcomed. I have stopped working on this, but if someone else wants to take this up and wants to collabarate with me, you are more than welcomed. Below have I provided a glimpse of all the TODO tasks.

It goes without saying that I would like to offer my thanks to Chelesa Finn (professor at Stanford) and S. Levine (professor at UC Berkley) (and many others who contributed in developing this) for making their code an open source and making their work on GPS public. I would like offer my thanks for their generous contribution to the field of Reinforcement Learning.

This code is unstable and needs lots of polishing. Many parts of the code are still under development. Currently, I am working out the math required and re-deriving all the important equations of the GPS (please refer to S. Levine thesis on learning motor skills).

Cite

Following people must be cited for their online/open source work iLQR, GPS, OpenAI. Also commerical softwares like MUJOCO.

Most of the code was written by taking inspiration from the original publishers but I have added my flavour. I have trimmed code for our purposes or I have added other functionalities. Please cite me if you are using this repository: Author: Sameer Kumar; Date: May 17th 2019; Title: GPS on Hopping task; Designation and School: Phd student in Texas A&M. That date corresponds to when I made this code public. Also for contact information you can refer to my website.

How to Install

Update: sudo apt-get update
Jupyter Notebook: Run python3 -m pip install --upgrade pip and then python3 -m pip install jupyter
Required Dependencies: Run pip3 install -r requirement.txt
Scikit-Learn: pip3 install -U scikit-learn
iLQR module: Go to ilqr-master_new and run python3 setup.py install
GPU Drivers: To install drivers for Nvidia which are needed for running GPU follow the instructions in the following link. Check if drivers are connected and they are responding by running nvidia-smi. This may give lot troubles hence be patient but this is the most hardest part of installation, after this is done you are all set. If you can't install tensorflow-gpu then just use tensorlfow (cpu version), it should be fine. To install tensorflow (cpu) remove the tensorflow-gpu which should have been installed by requirement.txt. For this run pip3 uninstall tensorflow-gpu and then run pip3 install tensorflow.
MUJOCO: To install go to following link. There you can install student license version of this software. This may take time please by patient, but should be easier than above.
Update and reboot (not necessary but recommened): sudo apt-get update && sudo reboot.
Finally run TrajGenerator-V2.ipynb

Information regarding Hopper:

Bodies: Torso, Thigh, Leg, Foot (kinematic chain in order as per xml file)
Torso: X: Slider, Y: Hinge, Z: Slider i.e, we have only linear movement in X, Z direction. And we have rotation in Y direction.
Thigh: Hinge joint in Y axis. Angle limit [-150, 0] degrees, Fricition 0.9.
Leg: Hinge joint in Y axis. Angle limit [-150, 0] degrees, Fricition 0.9.
Foot: Hinge joint in Y axis. Angle limit [-150, 0] degrees, Fricition 2.0.
States: X = [ZPos, XPos, YPos, YDeg, YDeg, YDeg] in the order of kinematic links. Velocities will also be in the same order.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.ipynb_checkpoints		.ipynb_checkpoints
GPS_Berkley		GPS_Berkley
GPS_code		GPS_code
Original_iLQR/ilqr-master		Original_iLQR/ilqr-master
Paper_Related		Paper_Related
__pycache__		__pycache__
catkin_ws		catkin_ws
gazebo		gazebo
ilqr-master_new		ilqr-master_new
mujoco		mujoco
rbdl		rbdl
results		results
Berkeley_Traj_Opt.ipynb		Berkeley_Traj_Opt.ipynb
CartPole.py		CartPole.py
CartPole_GPS.py		CartPole_GPS.py
EstimateDynamics.py		EstimateDynamics.py
GMM.ipynb		GMM.ipynb
GMM.py		GMM.py
GPS_CartPole.ipynb		GPS_CartPole.ipynb
Hopper.png		Hopper.png
LICENSE		LICENSE
MUJOCO_LOG.TXT		MUJOCO_LOG.TXT
Param_Dict.py		Param_Dict.py
README.md		README.md
Simulator.py		Simulator.py
Tensorflow_GPU_Check.ipynb		Tensorflow_GPU_Check.ipynb
Testing_Pickle.ipynb		Testing_Pickle.ipynb
TrajGenerator-V2.ipynb		TrajGenerator-V2.ipynb
TrajGenerator-V2.py		TrajGenerator-V2.py
TrajGenerator.ipynb		TrajGenerator.ipynb
TrajGenerator_V1.ipynb		TrajGenerator_V1.ipynb
TrajGenerator_V2.ipynb		TrajGenerator_V2.ipynb
Untitled.ipynb		Untitled.ipynb
for_testing.ipynb		for_testing.ipynb
hoppingbot.xml		hoppingbot.xml
requirement.txt		requirement.txt
test.py		test.py

License

guoyaq/Hopping_Bot

Folders and files

Latest commit

History

Repository files navigation

Hoppingbot

Unstable

Cite

How to Install

Information regarding Hopper:

Files to add:

TODO:

About

Resources

License

Stars

Watchers

Forks

Languages