wildlifeRL

An attempt at training RL agents to find optimal anti-poaching patrol routes in a simulated wildlife security game. Created during the 2016-2017 school year for USC Teamcore AI Lab.

Overall Objective:

Given a park grid with a certain spatial distribution of animals to protect, train the (anti-poaching) patroller/defender to take in the game state (i.e. park grid) as input, and produce some action (i.e. selected patrol locaitons) as output. If the defender picks locations that are close to poachers' locations, then the defender gets a reward.

Methods Tested:

ConvNet: Model the defender network using a ConvNet from 2D park grid to action vector
Vanilla Policy Gradient: Update the defender network using the game reward as a gradient signal
DDPG (Deep Deterministic Policy Gradient): Model the defender using two complementary neural networks: one actor network (to map from game state to action), and one critic network (to judge the goodness of the action)
Multi-Agent RL: Train reinforcement learning models for both the defenders (i.e. anti-poaching patrollers) and attackers (i.e. poachers), then see what happens

Sample DDPG Training Chart

Built With:

Python 3.5 (Anaconda build)
Numpy/Scipy
Tensorflow
Keras

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ConvNet		ConvNet
DDPG		DDPG
Simple Policy Gradient		Simple Policy Gradient
multiple_defenders - experience replay		multiple_defenders - experience replay
multiple_defenders - withadversaryNN		multiple_defenders - withadversaryNN
multiple_defenders		multiple_defenders
one_defender		one_defender
optimization		optimization
.gitignore		.gitignore
DDPG code framework - updated.txt		DDPG code framework - updated.txt
DDPG code framework.txt		DDPG code framework.txt
March2017_Notes.txt		March2017_Notes.txt
README.md		README.md
ddpg-critic-loss-chart.png		ddpg-critic-loss-chart.png
defender.py		defender.py

lucashu1/wildlifeRL

Folders and files

Latest commit

History

Repository files navigation

wildlifeRL

Overall Objective:

Methods Tested:

Built With:

Relevant Papers/Links:

About

Topics

Resources

Stars

Watchers

Forks

Languages