DOI: 10.1007/978-3-540-87559-8_11
|View full text |Cite
|
Sign up to set email alerts
|

Mimicking Go Experts with Convolutional Neural Networks

Abstract: Abstract. Building a strong computer Go player is a longstanding open problem. In this paper we consider the related problem of predicting the moves made by Go experts in professional games. The ability to predict experts' moves is useful, because it can, in principle, be used to narrow the search done by a computer Go player. We applied an ensemble of convolutional neural networks to this problem. Our main result is that the ensemble learns to predict 36.9% of the moves made in test expert Go games, improving… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(22 citation statements)
references
References 15 publications
0
22
0
Order By: Relevance
“…Most move predictors for Go are either using Neural Networks [6,7] or are estimating ratings for moves using the Bradley Terry (BT) model or related models [3,4,8]. Latter mentioned approaches model each move decision as a competition between players, the move chosen by the human expert player is then the winning player and its value is updated accordingly.…”
Section: Related Workmentioning
confidence: 99%
“…Most move predictors for Go are either using Neural Networks [6,7] or are estimating ratings for moves using the Bradley Terry (BT) model or related models [3,4,8]. Latter mentioned approaches model each move decision as a competition between players, the move chosen by the human expert player is then the winning player and its value is updated accordingly.…”
Section: Related Workmentioning
confidence: 99%
“…Finally, two-layer neural networks were constructed, and the accuracy of the evaluation was as high as 25%. In 2008, Sutskever and Nair [33] also constructed a two-layer neural network and used the soft Max layer in the last layer to predict moves. The prediction accuracy corresponded to 37% on GoGoGoD dataset.…”
Section: Related Workmentioning
confidence: 99%
“…The game of GO is a very popular research framework and some methods incorporate knowledge from records of expert players. Sutskever and Nair [7] train a Convolutional Neural Network over professional games in order to predict how experts play the game, beating the state of the art. Other approaches [16] use a complex combination of online learning, transient learning, expert knowledge and patterns learned offline.…”
Section: B Similar Models For Learning In Complex Gamesmentioning
confidence: 99%
“…Yet, many agents created to date are the result of extensively analysing what a human would do when faced with the same set of options. There are many techniques for using human knowledge, including: supervised learning through human feedback [6], modelling human opponents to guide the sampling [7], offline learning of a policy from expert play [8], inverse reinforcement learning from human demonstration [9] and writing hand-crafted heuristics based on expert advice [10] (this is also used to inform different search algorithms [5]). These have generally increased the level of play of the deployed agent, but either require a lot of development work or are expensive to run.…”
Section: Introductionmentioning
confidence: 99%