Learning to Play Go

This repository contains the code of our CSC2515 research project "Learning to Play Go". 📜Report

In the project, we implement an agent that learns to play Go on $9*9$ board through behaviour cloning from human knowledge and then refined with self-play. Our agent convincingly beats all baselines including several famous Go programs:

Our model vs baselines: Top row - our model plays black; Bottom row - our model plays white

CGLemon/pyDLGO
- mcts.py: Monte Carlo Tree Search implementation
- gtp.py: Go Text Protocol (GTP) support for playing against other Go programs
(https://github.com/ymgaq/Pyaq)
- board.py: Go board representation and operations
jtauber/sgf
- sgf.py: Smart Game Format (SGF) parser for reading game records

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
assets		assets
utils		utils
README.md		README.md
config.py		config.py
dataset.py		dataset.py
dlgo.py		dlgo.py
dump_position_from_datasets.py		dump_position_from_datasets.py
dump_self_play_results.py		dump_self_play_results.py
network.py		network.py
sgf.zip		sgf.zip
train_behaviour_cloning.py		train_behaviour_cloning.py
train_self_play.py		train_self_play.py

Parameter	Default
data_dir	"sgf"	Path to the data directory.
steps	400000	Training steps.
verbose_step	1000	Print verbose.
batch_size	2048	Batch size. Recommend to be at least 128.
learning_rate	1e-3	Learning rate.

Parameter	Default
weights		Path to the pre-trained model.
playouts	400	MCTS playouts,
resign-threshold	0.1	If the possibility of win is smaller than the specified value, then resign.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Learning to Play Go

Table of Contents

Install Dependancy

Prepare Data

Training

Run Engine on Command Line

Run Engine on Sabaki

Self-play training

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

chenyuntc/minigo

Folders and files

Latest commit

History

Repository files navigation

Learning to Play Go

Table of Contents

Install Dependancy

Prepare Data

Training

Run Engine on Command Line

Run Engine on Sabaki

Self-play training

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages