2019
DOI: 10.48550/arxiv.1902.04522
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(14 citation statements)
references
References 0 publications
0
13
0
Order By: Relevance
“…It is not clear a priori whether the approach scales down in terms of resources. Although cheaper in several ways, the agents in (Silver et al 2018), (Schrittwieser et al 2019), (Tian et al 2019), and(Lee et al 2019) still use thousands of GPU's or hundreds of TPU's to master board games. The recent KataGo (Wu 2019) reaches the level of ELF using 1/50 of the computation and implements several techniques to accelerate the learning.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It is not clear a priori whether the approach scales down in terms of resources. Although cheaper in several ways, the agents in (Silver et al 2018), (Schrittwieser et al 2019), (Tian et al 2019), and(Lee et al 2019) still use thousands of GPU's or hundreds of TPU's to master board games. The recent KataGo (Wu 2019) reaches the level of ELF using 1/50 of the computation and implements several techniques to accelerate the learning.…”
Section: Related Workmentioning
confidence: 99%
“…The amount of computational and financial resources that were required was so huge as to be out of reach for most, if not all, academic institutions. Not coincidentally these well-endowed projects and their follow-ups took place within giant multinational corporations of the IT sector (Tian et al 2019;Lee et al 2019). These companies deployed GPU's by the thousands and hundreds of TPU's.…”
Section: Introductionmentioning
confidence: 99%
“…While our implementation was heavily influenced by several different open-source AlphaZero implementations [22], [33]- [35], our unusual use-case -training small agents on small boards -lead to some unusual design decisions.…”
Section: A Alphazero Implementationmentioning
confidence: 99%
“…1) Small networks: The original AlphaZero and its opensource replications used very large residual convnets. ELF OpenGo [35], for example, uses a 256-filter 20-block convolutional network, weighing in at roughly 20m parameters and 2 GF-s for a forward pass on a single sample. In our preliminary work however, we found that on the small boards we work with, far smaller -and faster -networks could make it to perfect play.…”
Section: A Alphazero Implementationmentioning
confidence: 99%
“…Recent advances in deep reinforcement learning (RL) have given rise to systems that can outperform human experts at variety of games (Silver et al, 2017;Tian et al, 2019;OpenAI, 2018). These advances, even more-so than those from supervised learning, rely on significant numbers of training samples, making them impractical without large-scale, distributed parallelization.…”
Section: Introductionmentioning
confidence: 99%