A collection of implementations of various agents used in the paper Thinking Fast and Slow with Deep Learning and Tree Search . Currently features the following agents:
- a MCTS agent
- a CNN based agent that approximates a MCTS agent with 1000 simulations thinking time
All agents run in the minihex environment.