gzero/galvanise_zero

gzero provides a framework for neural networks to learn solely based on self play. This is largely based off Deepmind's papers on AlphaGo, AlphaGo Zero and AlphaZero, as well as the excellent exIT paper, and a number of open source projects were inspirational.

The name gzero stems from the fact that this project was initially a spin off my galvanise player in GGP.

Status January 2019

All games are written in GDL unless otherwise stated. There is no game specific code other than a single python file describing mappings for policy and state (see here for more information).

Here are the games that were trained for more than five days (on a couple of GPUs).

international draughts (statemachine in c++)
go(baduk) 9x9 (no super ko, statemachine in c++)
chess (with no 50 rule)
connect 6
amazons
hex (11 and 13 board sizes)
reversi (8 and 10 board sizes)
breakthrough

Amazons and Breakthrough models were strong enough to win gold medals at ICGA 2018 Computer Olympiad. 👏 👏 Also current LG Champion in Breakthrough.

Reversi is also strong relative to humans on LG, yet performs a bit worse than top AB programs (about ntest level 20 the last time I tested).

Hex/Connect6 play around somewhere top human level on LG, and are currently in the top tier Championships.

Chess and Baduk 9x9 are reasonably strong for the little time they were trained. Baduk 9x9 had a rating 2560 elo on CGOS after about a week of training. Chess was harder to test due to not having 50 rule, but somewhere about 2200-2600 elo would be a decent guess.

Also, Chess and Connect6 "cheated" as experimented with adding data from historical games as well as the self play data.

All the models can (eventually) be found here.

The code is in fairly good shape, but could do with some refactoring and documentation (especially a how to guide on how to train a game). It would definitely be good to have an extra pair of eyes on it. I'll welcome and support anyone willing to try training a game for themselves. Some notes:

python is 2.7
requires a GPU/tensorflow
good starting point is https://github.com/richemslie/ggp-zero/blob/dev/src/ggpzero/defs
the self play method is very different from A0, and not documented anywhere. the code is here: https://github.com/richemslie/ggp-zero/blob/dev/src/cpp/selfplay.cpp
cpp puct/puct2 really needs to be combined.

Little Golem

Most trained games are available to play on Little Golem website. Send an invite to play gzero_bot.

project goal(s)

The initial goal of this project was to be able to train any game in GGP ecosystem, to play at a higher level than the state of the art GGP players, given some (relatively) small time frame to train the game (a few days to a week, on commodity hardware - and not months/years worth of training time on hundreds of GPUs).

Some game types which would be interesting to try:

non-zero sum games (such as the non zero sum variant of Skirmish)
multiplayer games (games with > 2 players)
games that are not easily represented as a 2D array of channels
simultaneous games

Related repos (will be merged eventually here)

ggpzero is extension of ggplib
Custom games, game specific code (*) can be found here

(*) Most game specific game is for testing purposes, printing the board to console, or connecting to platforms/programs, such as GTP in go and UCI in chess. State machines for go(Baduk) and International Draughts are written in C++.

Name		Name	Last commit message	Last commit date
Latest commit History 488 Commits
bin		bin
doc		doc
src		src
.gitignore		.gitignore
LICENSE		LICENSE
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

doc

doc

src

src

.gitignore

.gitignore

LICENSE

LICENSE

readme.md

readme.md

Repository files navigation

gzero/galvanise_zero

Status January 2019

Little Golem

project goal(s)

Related repos (will be merged eventually here)

About

Releases

Packages

Languages

License

awesome-archive/galvanise_zero

Folders and files

Latest commit

History

Repository files navigation

gzero/galvanise_zero

Status January 2019

Little Golem

project goal(s)

Related repos (will be merged eventually here)

About

Resources

License

Stars

Watchers

Forks

Languages