2021
DOI: 10.1609/socs.v12i1.18545
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Learned Policies in Focal Search

Abstract: Recent machine-learning approaches to deterministic search and domain-independent planning employ policy learning to speed up search. Unfortunately, when attempting to solve a search problem by successively applying a policy, no guarantees can be given on solution quality. The problem of how to effectively use a learned policy within a bounded-suboptimal search algorithm remains largely as an open question. In this paper, we propose various ways in which such policies can be integrated into Focal Search, assum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…More formally, if s is a state in FOCAL, at any point during the execution of FS, its priority is given by disc(path(s)). FDS has been proposed recently in the context of learned policies (Araneda, Greco, and Baier 2021), in which the notion of discrepancy is not defined in terms of a heuristic but rather in terms of a policy.…”
Section: Focal Discrepancy Search For Learned Heuristicsmentioning
confidence: 99%
See 2 more Smart Citations
“…More formally, if s is a state in FOCAL, at any point during the execution of FS, its priority is given by disc(path(s)). FDS has been proposed recently in the context of learned policies (Araneda, Greco, and Baier 2021), in which the notion of discrepancy is not defined in terms of a heuristic but rather in terms of a policy.…”
Section: Focal Discrepancy Search For Learned Heuristicsmentioning
confidence: 99%
“…Spies et al (2019) proposed to use the learned heuristic to sort FOCAL. Araneda, Greco, and Baier (2021) proposed an approach to exploit learned policies, rather than a heuristic, in the context of FS. They show that an effective way to incorporate policies in FS is by exploiting the notion of discrepancy (Harvey and Ginsberg 1995), which given a stateaction sequence s 0 a 0 s 1 a 1 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Focal Discrepancy Search (FDS) (Araneda, Greco, and Baier 2021;Greco, Araneda, and Baier 2022) is a version of Focal Search which sorts FOCAL by the discrepancy associated with the path of each state. More formally, if s is a state in FOCAL, at any point during the execution of FS, its priority is given by disc(path(s)), which is the number of times along the path where the state with the best heuristic value was not selected for expansion.…”
Section: Focal Discrepancy Searchmentioning
confidence: 99%
“…BWAS does not provide suboptimality guarantees, as a consequence of a second drawback of neural-net heuristics: they are not admissible, thus they cannot be directly used by the many search algorithms that exploit admissibility to provide quality guarantees. Recently, Spies et al [2019] and Araneda, Greco, and Baier;Greco, Araneda, and Baier [2021;2022] proposed the use of neural-net heuristics in combination with admissible heuristics in Focal Search (FS). However, these approaches do not address the problem of slow heuristic computation.…”
Section: Introductionmentioning
confidence: 99%