Cow Herding Final

1. Requirements:

Please see requirements.txt for full output of my virtual environment as reported by pip freeze. All packages may not be necessary.

mesa - https://github.com/projectmesa/mesa (pip install git+https://github.com/projectmesa/mesa)

numpy

dill

python 3.6.4 via Anaconda

2. Runnable files:

Output Notebook - A Jupyter notebook where you can easily change the number of episodes and steps per episode to see the resulting output.

run.py - Runs a visualization in the web browser. Probably the best way to see the agents at work. Change the number/type of agents at the bottom of server.py where it says ### Change number of agents here! ###. For best results, please keep teams of two agents since that's what the Q table is trained on.

collect_data.py - This is what I used to collect my data. Runs 3000 episodes with a team of two Random Agents, 3000 episodes with a team of Plan Agents, and 3000 episodes with a team of two Monte Carlo agents. Exports data to csv.

batch_run.py - DO NOT RUN THIS UNLESS YOU WANT TO RETRAIN THE MONTE CARLO AGENTS Overwrites previously trained Q table and begins the

training process over. Trains a team of two Monte Carlo agents for 25,000 episodes, each with 500 timesteps.

3. Other files:

Research and results writeup: https://github.com/NujjA/Cow-Herding-Final/blob/master/Reinforcement%20Learning%20in%20a%20Dynamic%20Multi-Agent%20Environment.pdf

montecarlo.py - Agents that perform Monte Carlo learning and build a Q table

trained_mc_agent.py - Agents that use a pre-trained Q table to select actions

random_agent.py - Randomly moving agents

cow_agent.py - Cow Agent behavior

plan_agent.py - Plan following agents

cow_methods.py - Helper methods for cow behavior and interaction

movement_control.py - Agent motion helper methods

rl_methods.py - Helper methods for reinforcement learning

model.py - Builds the environment and places agents in it. Runs the simulation time.

mc_q_save.pkl, mc_q_save.npy - Saved/Trained Q table

wall.py - Wall agents that make up the corral

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
.gitattributes		.gitattributes
18_07_05_10_14_save.csv		18_07_05_10_14_save.csv
18_07_05_10_14_save.xlsx		18_07_05_10_14_save.xlsx
18_07_08_12_43_save.csv		18_07_08_12_43_save.csv
OutputNotebook.ipynb		OutputNotebook.ipynb
README.md		README.md
Reinforcement Learning in a Dynamic Multi-Agent Environment.pdf		Reinforcement Learning in a Dynamic Multi-Agent Environment.pdf
batch_run.py		batch_run.py
collect_data.py		collect_data.py
cow_agent.py		cow_agent.py
cow_methods.py		cow_methods.py
mc_q_save.npy		mc_q_save.npy
mc_q_save.pkl		mc_q_save.pkl
model.py		model.py
montecarlo.py		montecarlo.py
movement_control.py		movement_control.py
plan_agent.py		plan_agent.py
proposal.pdf		proposal.pdf
random_agent.py		random_agent.py
requirements.txt		requirements.txt
rl_methods.py		rl_methods.py
run.py		run.py
server.py		server.py
td_agent.py		td_agent.py
trained_mc_agent.py		trained_mc_agent.py
wall.py		wall.py

NujjA/Cow-Herding-Final

Folders and files

Latest commit

History

Repository files navigation

Cow Herding Final

1. Requirements:

2. Runnable files:

3. Other files:

About

Resources

Stars

Watchers

Forks

Languages