19*19 Go board ^[1]

The game of Go has attracted game researchers and programmers as an ambitious AI-challenge. Albert Zobrist was a pioneer, who wrote the first Go program in 1968 as part of his Ph.D. Thesis on pattern recognition ^[2]. Chess programmers, beside others, Rémi Coulom and Gian-Carlo Pascutto became successful Go programmers with their programs CrazyStone and Leela respectively. Competitive computer Go, as organized by the ICGA ^[3], is played on boards with 9x9 as well with default 19x19 grids. Since Go lacks a simple evaluation function mainly based on counting material, attempts to apply similar techniques and algorithms as in chess were less successful. The breakthrough in computer Go was accomplished by Monte-Carlo tree search and deep learning.

Progress

Monte-Carlo Go

After early trials to apply Monte Carlo methods to a Go playing program by Bernd Brügmann in 1993 ^[4], recent developments since the mid 2000s by Bruno Bouzy ^[5], and by Rémi Coulom, who coined the term Monte-Carlo Tree Search ^[6], in conjunction with UCT (Upper Confidence bounds applied to Trees) introduced by Levente Kocsis and Csaba Szepesvári ^[7], led to a breakthrough in computer Go ^[8].

CNNs

As mentioned by Ilya Sutskever and Vinod Nair in 2008 ^[9], convolutional neural networks are well suited for problems with a natural translation invariance, such as object recognition. Go has some translation invariance, because if all the pieces on a hypothetical Go board are shifted to the left, then the best move will also shift (with the exception of pieces that are on the boundary of the board). Many applications of neural networks to Go have already used convolutional neural networks, such as Nicol N. Schraudolph et al. ^[10], Erik van der Werf et al. ^[11], and Markus Enzenberger ^[12], among others.

In 2014, two teams independently investigated whether deep convolutional neural networks ^[13] could be used to directly represent and learn a move evaluation function for the game of Go. Christopher Clark and Amos Storkey trained an 8-layer convolutional neural network by supervised learning from a database of human professional games, which without any search, defeated the traditional search program Gnu Go in 86% of the games ^[14] ^[15] ^[16] ^[17] ^[18]. In their paper Move Evaluation in Go Using Deep Convolutional Neural Networks ^[19], Chris J. Maddison, Aja Huang, Ilya Sutskever, and David Silver report they trained a large 12-layer convolutional neural network in a similar way, to beat Gnu Go in 97% of the games, and matched the performance of a state-of-the-art Monte-Carlo Tree Search that simulates a million positions per move ^[20].

AlphaGo

In 2015, a team affiliated with Google DeepMind around David Silver, Aja Huang, Chris J. Maddison, and Demis Hassabis, supported by Google researchers John Nham and Ilya Sutskever, build a Go playing program dubbed AlphaGo, combining Monte-Carlo tree search with their 12-layer networks ^[21], the “policy network,” to select the next move, the “value network,” to predict the winner of the game. The neural networks were trained on 30 million moves from games played by human experts, until it could predict the human move 57 percent of the time. AlphaGo achieved a huge winning rate against other Go programs, and defeated European Go champion Fan Hui ^[22] in October 2015 with a 5 - 0 score ^[23] On March 9 to 15, 2016, AlphaGo won a $1M 5-game challenge match in Seoul versus Lee Sedol with 4 - 1 ^[24] ^[25] ^[26]. During The Future of Go Summit from May 23 to 27, 2017 in Wuzhen, China, AlphaGo won a three-game match versus current world No. 1 ranking player Ke Ji. After the Summit, AlphaGo is now retired from competitive play while DeepMind continues AI research in other areas ^[27].

AlphaGo Zero & AlphaZero

However, in October 2017, AlphaGo Zero, an evolution of AlphaGo was introduced. While previous versions were initially trained on thousands of human amateur and professional games to learn how to play Go, AlphaGo Zero learns exclusively by playing games against itself, starting from completely random play, to quickly surpass human level of play and defeated the previously published champion-defeating version of AlphaGo by 100 games to 0 ^[28] ^[29]. AlphaGo Zero was further improved and even generalized for other games now dubbed AlphaZero, as published in December 2017 ^[30].

Fine Art

Fine Art is a Go playing entity developed since 2016 under the patronage of the Chinese media company Tencent by a team around Liu Yongsheng, along with Ma Bo, Tang Shanmin, Wu Guangyu, and Zhang Kaixu. It won the Computer Go UEC Cup at the University of Electro-Communications, Chōfu, Tokyo, Japan, in March 2017 against a field of 27 other programs including DeepZenGo and Crazy Stone ^[31]. In January 2018, it defeated Ke Jie 9P in 77 moves after giving two stones handicap ^[32] on Fox Weiqi ^[33] server ^[34].

Quotes

Quote by Gian-Carlo Pascutto in 2010 ^[35]:

There is no significant difference between an alpha-beta search with heavy LMR  and a static evaluator (current state of the art in chess) and an UCT searcher with a small exploration constant that does playouts (state of the art in go).

The shape of the tree they search is very similar. The main breakthrough in Go the last few years was how to backup an uncertain Monte Carlo score. This was solved. For chess this same problem was solved around the time quiescent search was developed.

Both are producing strong programs and we've proven for both the methods that they scale in strength as hardware speed goes up.

So I would say that we've successfully adopted the simple, brute force methods for chess to Go and they already work without increases in computer speed. The increases will make them progressively stronger though, and with further software tweaks they will eventually surpass humans.

Computer Olympiads

Selected Publications

^[36]

1960 ...

Jack Good (1965). The Mystery of Go. Literature: Reports hosted by Atlas Computer Laboratory

1970 ...

Albert Zobrist (1970). Feature Extraction and Representation for Pattern Recognition and the Game of Go. Ph.D. Thesis (152 pp.), University of Wisconsin, also published as technical report, pdf
Walter R. Reitman, James Kerwin, Robert Nado, Judith S. Reitman, Bruce Wilcox (1974). Goals and Plans in a Program for Playing Go. Proceedings of the 29th ACM Conference, reprinted in David Levy (ed.) (1988). Computer Games II. google books
Walter R. Reitman, Bruce Wilcox (1975). Perception and representation of spatial relations in a program for playing Go. Proceedings of the 30th ACM Conference, reprinted in David Levy (ed.) (1988). Computer Games II.
David B. Benson (1976). Life in the Game of Go. Information Sciences, Vol. 10, pdf
Judith S. Reitman (1976). Skilled Perception in Go: Deducing Memory Structures from Inter-Response Times. Cognitive Psychology, Vol. 8, No, 3
Walter R. Reitman, Bruce Wilcox (1977). Pattern Recognition and Pattern-Directed Inference in a Program for Playing Go. ACM SIGART Bulletin, No. 63, reprinted in David Levy (ed.) (1988). Computer Games II.
Editor (1979). Computer GO Has Come. Personal Computing, Vol. 3, No. 5, pp. 55
Walter R. Reitman, Bruce Wilcox (1979). Modelling Tactical Analysis and Problem Solving in Go. Proceedings of the Tenth Annual Pittsburgh Conference on Modelling and Simulation
Walter R. Reitman, Bruce Wilcox (1979). The Structure and Performance of the INTERIM.2 Go Program. IJCAI 1979, reprinted in David Levy (ed.) (1988). Computer Games II.

1980 ...

Editor (1980). The Challenge of Computer Go. Personal Computing, Vol. 4, No. 8, pp. 79
Editor (1980). Zobrist Program in Action. Personal Computing, Vol. 4, No. 8, pp. 81 » Albert Zobrist
Bruce Wilcox (1985). Reflections on building two Go programs. ACM SIGART Bulletin, No. 94
Anders Kierulf (1985). Smart Go Board: Algorithms for the Tactical Calculator. Diploma thesis (unpublished), ETH Zurich.
David Levy (ed.) (1988). Computer Games II. Springer, ISBN: 978-1-4613-8756-5, Chapter 5 - Go
Janusz Kraszek (1988). Heuristics in the life and death algorithm of a Go playing program. Computer Go 9 (Winter 1988-89)
Anders Kierulf, Jürg Nievergelt (1989). Swiss Explorer blunders its way into winning the first computer Go Olympiad. Heuristic Programming in AI 1
Ken Chen (1989). Group Identification in Computer Go. Heuristic Programming in AI 1

1990

Ken Chen (1990). The Move Decision Process of Go Intellect, Computer Go, No.14, pp. 9-17.
Ken Chen, Anders Kierulf, Martin Müller, Jürg Nievergelt (1990). The Design and Evolution of Go Explorer. Computers, Chess, and Cognition
Kiyoshi Shirayanagi (1990). Knowledge Representation and its Refinement in Go Programs. Computers, Chess, and Cognition

1991

David Wolfe (1991). Mathematics of Go: Chilling Corridors. Ph.D. thesis, University of California, Berkeley, advisor Elwyn Berlekamp
Herbert D. Enderton (1991). The Golem Go Program. Technical Report CMU-CS-92-101, Carnegie Mellon University
Ken Chen (1991). Go Intellect wins two Gold Medals. Heuristic Programming in AI 2
Barney Pell (1991). Exploratory Learning in the Game of GO. Heuristic Programming in AI 2
Thomas Wolf (1991). Investigating Tsumego Problems with "RisiKo". Heuristic Programming in AI 2

1992

Warren D. Smith (1992). Hash functions for Binary and Ternary Words. NEC Research Institute, ps
Thomas Wolf (1992). Generating tsume go problems with GoTools. Heuristic Programming in AI 4 ^[37]

1993

David Fotland (1993). Knowledge representation in the Many Faces of Go.
Bernd Brügmann (1993). Monte Carlo Go. pdf ^[38]
Nicol N. Schraudolph, Peter Dayan, Terrence J. Sejnowski (1993). Temporal Difference Learning of Position Evaluation in the Game of Go. NIPS 1993 ^[39]

1994

Elwyn Berlekamp, David Wolfe (1994). Mathematical Go - Chilling Gets the Last Point. A K Peters Ltd., also in paperback as Mathematical Go Endgames: Nightmares For the Professional Go Player. Ishi Press ^[40]

1995 ...

Jay Burmeister, Janet Wiles (1995). CS-TR-339 Computer Go Tech Report. Departments of Computer Science and Psychology, The University of Queensland, QLD 4072, Australia (permanently - under construction)
Jay Burmeister, Janet Wiles (1995). The Challenge of Go as a Domain for AI Research: A Comparison Between Go and Chess. In Proceedings of the Third Australian and New Zealand Conference on Intelligent Information Systems, IEEE Western Australia Section, pdf
Tristan Cazenave (1995). Learning and Problem Solving in Gogol, a Go playing program. pdf

1996

Bruce Wilcox, Sue Wilcox (1996). EZ-GO-Oriental Strategy in a Nutshell. KI Press, ISBN13 978-0965223546 ^[41]
Markus Enzenberger (1996). The Integration of A Priori Knowledge into a Go Playing Neural Network.
Martin Müller, Ralph Gasser (1996). Experiments in Computer Go Endgames. Games of No Chance edited by Richard J. Nowakowski
Martin Müller, Elwyn Berlekamp, Bill Spight (1996). Generalized thermography: Algorithms, implementation, and application to Go endgames. Technical Report 96-030, ICSI Berkeley, 1996. postscript
Bruno Bouzy, Tristan Cazenave (1996). Shared concepts between complex systems and the game of Go. pdf

1997

Tristan Cazenave (1997). Gogol (an Analytical Learning Program). IJCAI'97, pdf
Bruno Bouzy, Tristan Cazenave (1997). Using the Object Oriented Paradigm to Model Context in Computer Go. Context'97, pdf
David Fotland, Atsushi Yoshikawa (1997). The 3rd Fost-Cup World-Open Computer-Go Championship. ICGA Journal, Vol. 20, No. 4