Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence 2019
DOI: 10.24963/ijcai.2019/772
|View full text |Cite
|
Sign up to set email alerts
|

Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search

Abstract: We propose an approach to general subgoal-based temporal abstraction in MCTS. Our approach approximates a set of available macro-actions locally for each state only requiring a generative model and a subgoal predicate. For that, we modify the expansion step of MCTS to automatically discover and optimize macro-actions that lead to subgoals. We empirically evaluate the effectiveness, computational efficiency and robustness of our approach w.r.t. different parameter settings in two benchmark domains and compare t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 7 publications
0
10
0
Order By: Relevance
“…Different action abstraction Fig. 3 Representation of options within a tree methods can be found in Gabor et al (2019), Moraes et al (2018), and Uriarte and Ontanón (2014).…”
Section: Action Reductionmentioning
confidence: 99%
“…Different action abstraction Fig. 3 Representation of options within a tree methods can be found in Gabor et al (2019), Moraes et al (2018), and Uriarte and Ontanón (2014).…”
Section: Action Reductionmentioning
confidence: 99%
“…Endowing MCTS with both techniques provides more accurate value estimation. Different action abstraction methods can be found in (Gabor et al, 2019;Moraes et al, 2018;Uriarte and Ontanón, 2014).…”
Section: Action Reductionmentioning
confidence: 99%
“…Existing approaches based on more powerful subgoal search methods, on the other hand, have their limitations. [9] is perhaps the closest to our method and uses an MCTS to search the subgoal-induced graph. However, it uses a predefined (not learned) predicate function as a subgoal generator, limiting applicability to the problems with available high-quality heuristics.…”
Section: Related Workmentioning
confidence: 99%