alphazero-solver

Implementation of the AlphaZero algorithm for qubic - (3 Dimensional Tic Tac Toe) and connect4.

Authors: Sasidharan Mahalingam (Overall framework integration, modyfing and fixes bugs in the alpha zero implementation) Rafael Espericueta (Implementation of the Simple Heuristic Agent) Eliana Stefani (Implementation of MiniMax Agent)

Packages Required: Python - 3.5 or later Tensorflow - 1.12 or later (Tensorflow-gpu with a working GPU accelarator recommended) Atleast 400 GB of free space on disk recommeded to saved the models trained

Instructions to Train an AlphaZero Agent for Connect4: python main.py --game connect4

Instructions to Train an AlphaZero Agent for Qubic: python main.py --game qubic

Instructions to run the trained model for connect4: python pit.py -p -o (random, heuristic, minimax, alphazero, human)

Instructions to run the trained model for qubic: python pit_qubic.py -p -o (random, heuristic, minimax, alphazero, human)

Acknowledgement:
The framework is based on the implementation of Surag Nair for the game Othello

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
__pycache__		__pycache__
connect4		connect4
othello		othello
pytorch_classification		pytorch_classification
qubic		qubic
Arena.py		Arena.py
Coach.py		Coach.py
Game.py		Game.py
LICENSE		LICENSE
MCTS.py		MCTS.py
NeuralNet.py		NeuralNet.py
README.md		README.md
main.py		main.py
pit.py		pit.py
pit_qubic.py		pit_qubic.py
setup_env.sh		setup_env.sh
utils.py		utils.py

License

sasidharan-m/qubic-solver

Folders and files

Latest commit

History

Repository files navigation

alphazero-solver

About

Resources

License

Stars

Watchers

Forks

Languages