2021
DOI: 10.1007/978-3-030-89453-5_2
|View full text |Cite
|
Sign up to set email alerts
|

Stabilized Nested Rollout Policy Adaptation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 22 publications
1
5
0
Order By: Relevance
“…Beam-NRPA (Cazenave and Teytaud 2012) optima. High-Diversity NPRA (HD-NRPA) (Edelkamp and Cazenave 2016) elaborates on this observation to increase the diversity of the beam, so that according to some specification of distance solutions too close to existing ones are removed from the beam.…”
Section: Monte Carlo Search Frameworkmentioning
confidence: 99%
“…Beam-NRPA (Cazenave and Teytaud 2012) optima. High-Diversity NPRA (HD-NRPA) (Edelkamp and Cazenave 2016) elaborates on this observation to increase the diversity of the beam, so that according to some specification of distance solutions too close to existing ones are removed from the beam.…”
Section: Monte Carlo Search Frameworkmentioning
confidence: 99%
“…After that, a new Nested Rollout Policy Adaptation algorithm achieved a new 82 steps record [25]. Thereafter, Cazenave applied Beam Nested Rollout Policy Adaptation [26], which reached the same 82 steps record but did not exceed it, indicating the difficulty of making further progress on Morpion Solitaire using traditional search heuristics.…”
Section: Related Workmentioning
confidence: 99%
“…A score of 80 moves was found by means of Nested Monte-Carlo search [24]. In addition, [25] found a new record with 82 steps, and [26] also found a 82 steps solution. It has been proven mathematically that the 5D version has an upper bound of 121 [27].…”
Section: Morpion Solitairementioning
confidence: 99%
“…Stabilized NRPA [12] is a simple improvement of NRPA. The principle is to play P playouts at level 1 before each call to the adapt function.…”
Section: Stabilized Gnrpamentioning
confidence: 99%
“…Beam NRPA has already been applied successfully to the TSPTW and to Morpion Solitaire [14]. The best results were obtained using a beam at level 1.…”
Section: Beam Gnrpamentioning
confidence: 99%