2012
DOI: 10.1109/tciaig.2012.2186810
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Monte Carlo Tree Search Methods

Abstract: Monte Carlo Tree Search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
1,634
0
7

Year Published

2014
2014
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 2,314 publications
(1,644 citation statements)
references
References 126 publications
3
1,634
0
7
Order By: Relevance
“…UCT (Upper Confidence bounds applied to Trees) [8] is a state-of-the-art anytime algorithm that combines MCTS [3] with multi-bandit selection methods [8], and has been utilised for planning in domains pervaded by uncertainty. UCT [5] allows to quickly return a non-trivial decision after performing a series of rollouts in which outcomes of actions are sampled based on their probability.…”
Section: Preliminariesmentioning
confidence: 99%
See 2 more Smart Citations
“…UCT (Upper Confidence bounds applied to Trees) [8] is a state-of-the-art anytime algorithm that combines MCTS [3] with multi-bandit selection methods [8], and has been utilised for planning in domains pervaded by uncertainty. UCT [5] allows to quickly return a non-trivial decision after performing a series of rollouts in which outcomes of actions are sampled based on their probability.…”
Section: Preliminariesmentioning
confidence: 99%
“…New decision nodes for the outcomes of the newly generated chance node are also added to the tree. The subsequent rollout phase of UCT is modified so that, at each iteration of the algorithm, a number r of rollouts are carried out for each non-terminal outcome 3 . This allows to quickly obtain accurate reward estimates for the state from which rollouts are being currently performed, as well as thoroughly exploring the different courses of action available from each outcome.…”
Section: Individual Planningmentioning
confidence: 99%
See 1 more Smart Citation
“…Children of the node are scored with different methods. The most common one is the Upper Confidence Bound (UCB) score [20],…”
Section: Methodsmentioning
confidence: 99%
“…The significant success of Monte Carlo tree search (MCTS) [20] in computer Go game [14] inspired researchers to develop similar approaches in different research areas including other type of games [21][22][23][24]. MCTS is a guided-random best-first search method that models the search space as a gradually expanded tree.…”
Section: Monte Carlo Tree Searchmentioning
confidence: 99%