GitHub - sjmcleaver/phillip: The SSBM "Phillip" AI.

#The Phillip AI An SSBM player based on Deep Reinforcement Learning.

Requirements

Tested on: Ubuntu >=14.04, OSX. If you want Windows support, go bug the dolphin developers to support MemoryWatcher and Pipe Input on Windows! A fork that supports this is under works.

A recent version of dolphin. Probably need to compile from source on Linux.
Python 3.
Tensorflow 0.11.
A few python packages

pip3 install attrs
Install phillip:

pip3 install -e .

Play

Trained agents are stored in the agents directory.

phillip --gui --human --start 0 --load agents/FalconFalconFD

Train

Training is controlled by phillip/train.py. See also runner.py and launcher.py for training massively in parallel on slurm clusters. Phillip has been trained at the MGHPCC. It is recommended to train with a custom dolphin from https://github.com/vladfi1/dolphin - the below commands will likely fail otherwise.

Local training is also possible. First, edit runner.py with your desired training params (advanced). Then do:

python3 runner.py # will output a path
python3 launcher.py saves/path/ --init --local [--agents number_of_agents] [--log_agents]

To view stats during training:

tensorboard --logdir logs/

The trainer and (optionally) agents redirect their stdout/err to slurm_logs/. To end training:

kill $(cat saves/path/pids)

To resume training run launcher.py again, but omit the --init (it will overwrite your old network).

Support

Come to the Discord!

Recordings

I've been streaming practice play over at http://twitch.tv/x_pilot. There are also some recordings on my youtube channel.

##Credits

Big thanks to https://github.com/altf4/SmashBot for getting me started, and to https://github.com/spxtr/p3 for a python memory watcher. Some code for dolphin interaction has been borrowed from both projects (mostly the latter now that I've switched to pure python).

Name		Name	Last commit message	Last commit date
Latest commit History 767 Commits
agents		agents
enemies		enemies
movies		movies
phillip		phillip
.gitignore		.gitignore
LICENSE		LICENSE
Readme.md		Readme.md
SmashLadderClient.py		SmashLadderClient.py
dolphin.sh		dolphin.sh
instructions.txt		instructions.txt
launcher.py		launcher.py
run.sh		run.sh
runner.py		runner.py
scancel.sh		scancel.sh
setup.py		setup.py
stream.py		stream.py
twitchbot.py		twitchbot.py

License

sjmcleaver/phillip

Folders and files

Latest commit

History

Repository files navigation

Requirements

Play

Train

Support

Recordings

About

Resources

License

Stars

Watchers

Forks

Languages