GitHub - pramodiisc/Stoch3_foot: Stoch3_foot simulation for spring

Robust Quadrupedal Locomotion on Sloped Terrains:A Linear Policy Approach

Introduction:

This is the main code base accompanying the paper with the above title, under review in CoRL 2020.

Getting Started:

To install the package and its dependenclies run

Inside the folder, Stoch2_gym_env:

    python -m pip install .

The code base was tested with gym (0.17.2), pybullet (2.8.2) with a python version of 3.6.9. However it is expected to work fine for any future versions of these packages, though they havent been tested.

Robots Tested for:

Stoch2:

Orientation\Elevation	-11°	-9°	7°	-5°	5°	7°	9°	11°
0°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
30°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
60°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
90°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️

HyQ:

Orientation\Elevation	-13°	-11°	-9°	7°	-5°	5°	7°	9°	11°	13°
0°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
30°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
60°	✔️	❌	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
90°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	❌

Laikago:

Orientation\Elevation	-13°	-11°	-9°	7°	-5°	5°	7°	9°	11°	13°
0°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
30°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️
60°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	❌	❌
90°	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️	✔️

To make an initial policy:

As explained in the paper, we take a guided learning approach wherein the role of a initial policy is quite crucial.To train your own initial policy,run the following command

    python create_initial_policy.py --policyName filename

This saves the initial policy as filename.npy in the initial_policies folder.This file is to be loaded later as the initial polcy when you want train your own polciy.However there are a few initial policies present in the same folder which could be directly used to start the ARS training.

Note: The initial policies are by default saved in the initial_policies folder.

To train the linear policy with ARS

This is where the ARS trainining starts,with the initial polciy trainied in the previous step.

    python train_policy.py

The above command starts the training in the default and the best by far hyperparameters and experiment settings. However, the following parameters can be customized in the training as desired by the user.

Parameter	About	type
--render	flag for rendering	bool
--policy	initial polciy to start the training with	str
--logdir	Directory root to log policy files (npy)	str
--lr	learning rate	float
--noise	amount of random noise to be added to the weights	float
--msg	any message acompanying the training	str
--curi_learn	Number of learning iteration before changing the curriculum	int
--eval_step	Number of policy iterations before a policy update	int
--episode_length	Horizon of a episode	int
--domain_Rand	randomizatize the dynamics of the environment while training	int(ony 0 or 1)
--episode_length	Horizon of a episode	int
--domain_rand	set domain randomization	int

For example,

  python train_policy.py --lr 0.05 --noise 0.04 --logdir testDir --policy init_policy_TS.npy --msg "Training with some paramters" --episode_length 400

Note:

The initial policies are by default loaded from the initial_policies folder and the log directory is saved inside the experiments folder.
The are a few other insignificant parameters which need not be changed for the training, for more info about the parameters run
```
 python train_policy.py --help
```

To conduct tests on a policy

To run a policy in default conditions, the following command is to be used.

    python test_policy.py

the following test parameters can be changed while testing the policy,

Parameter	About	type	Allowed values	unit
--PolicyDir	directory of the policy to be tested	str	(check the experiments folder)	-
--Stairs	load staircase	bool	True or False	unitless
--WedgeIncline	the elevation angle of wedge	int	0,5,7,9,11,13	Degrees(°)
--WedgeOrientation	the yaw angle of wedge about world z axis	float	-90.0 to 90.0	Degrees(°)
--EpisodeLength	number of gait steps of a episode	int	0 to inf	number of steps
--MotorStrength	maximum motor strength that could be applied	float	5.0 to 8.0	NewtonMetre(Nm)
--FrictionCoeff	coefficient of friction to be set	float	0.55 to 0.80	unitless
--FrontMass	mass to be loaded to the front half of the body	float	0.0 to 0.15	Kilograms(Kg)
--BackMass	mass to be loaded to the rear half of the body	float	0.0 to 0.15	Kilograms(Kg)
--RandomTest	flag to activate random sampling	bool	True or False	unitless
--seed	seed for random sampling	int	-	unitless
--PerturbForce	perturbation force to applied perpendicular to the heading direction of the robot	float	-120 to 120	Newton(N)
--AddImuNoise	flag to add noise in IMU readings	bool	True or False	unitless

Thus, for a

custom test

 python test_policy.py --PolicyDir 23July3 --WedgeIncline 11 --WedgeOrientation 15 --FrontMass 0.1 --FrictionCoeff 0.6

random test

 python test_policy.py --PolicyDir 23July3 --RandomTest True --seed 100

To conduct tests on a staircase

To run a policy on a staircase of fixed dimensions, the following command is to be used.

    python test_policy.py --Stairs True

To conduct tests on a arbitary slopes

To run a policy on a arbitary slope track, the following command is to be used.

    python arbitary_slope_test.py

Note:

The test policies are by default loaded from the path experiments/given_logdir_name/iterations/best_policy.npy", if not specified it loads the best ever policy pre-trained by us.
For loading the policies from other directories, you might have to change the path from within the test_policy.py file.
In our method we only train for +ve roll and -ve pitch conditions of support plane, the trained policy is able to generalize for other conditions too.
Our env is not fully supported for training in downhill case, but you can evalute policy in downhill conditions.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
experiments		experiments
gym_stoch2_sloped_terrain.egg-info		gym_stoch2_sloped_terrain.egg-info
gym_stoch2_sloped_terrain		gym_stoch2_sloped_terrain
initial_policies		initial_policies
utils		utils
.command.swp		.command.swp
CheckWorkspace.py		CheckWorkspace.py
HyQObligue.mp4		HyQObligue.mp4
HyQuphill.mp4		HyQuphill.mp4
LaikagoUphill.mp4		LaikagoUphill.mp4
README.md		README.md
Stoch3_spring.py		Stoch3_spring.py
arbitary_slope_test.py		arbitary_slope_test.py
checkStoch3.py		checkStoch3.py
command.txt		command.txt
command_HyQ.txt		command_HyQ.txt
command_Laikago.txt		command_Laikago.txt
create_initial_policy.py		create_initial_policy.py
full_bezier_spreing.py		full_bezier_spreing.py
setup.py		setup.py
tempCodeRunnerFile.py		tempCodeRunnerFile.py
testHyQ_policy.py		testHyQ_policy.py
testLaikago_policy.py		testLaikago_policy.py
testStoch3_policy.py		testStoch3_policy.py
test_policy.py		test_policy.py
trainHyQ_policy.py		trainHyQ_policy.py
trainLaikago_policy.py		trainLaikago_policy.py
trainStoch3_policy.py		trainStoch3_policy.py
train_policy.py		train_policy.py

pramodiisc/Stoch3_foot

Folders and files

Latest commit

History

Repository files navigation

Robust Quadrupedal Locomotion on Sloped Terrains:A Linear Policy Approach

Introduction:

Getting Started:

Robots Tested for:

Stoch2:

HyQ:

Laikago:

To make an initial policy:

To train the linear policy with ARS

To conduct tests on a policy

To conduct tests on a staircase

To conduct tests on a arbitary slopes

About

Resources

Stars

Watchers

Forks

Languages