Neural Network Heuristic Functions for Classical Planning: Bootstrapping and Comparison to Other Methods

Ferber, Patrick; Geißer, Florian; Trevizan, Felipe W.; Helmert, Malte; Hoffmann, Jörg

doi:10.1609/icaps.v32i1.19845

Cited by 7 publications

(19 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The coverage of a planner is defined as the percent of initial states for which a solution path is found within the given planning budget. Ferber et al (2021) report observing that in general the coverage superiority between the different NN heuristics tested did not vary over time. That is, the planning time used and the relative coverage superiority between the algorithms were not correlated.…”

Section: Experimental Studymentioning

confidence: 92%

“…The baseline option for performing the rollout, and the method used by Ferber et al (2021), is to randomly select actions a for which applying the regression operator is valid. In addition to testing RSL using random action selection we instantiate a version of RSL we name Novelty guided Regression based Learning (N-RSL) that aims to increase the structural diversity of operators selected in its regression.…”

Section: Extracting State Sets Through Regressionmentioning

confidence: 99%

“…Our benchmark set of domains, instances and initial states is the "Hard Task" set as introduced by Ferber et Ferber et al (2021). According to standard single thread CPU benchmarks, our vCPU can be 20% faster.…”

Section: Experimental Studymentioning

confidence: 99%

“…In this work, we introduce the Regression based Supervised Learning (RSL) algorithm in order to learn per instance NN defined heuristic functions. Like other methods (Ferber et al 2021;Yu, Kuroiwa, and Fukunaga 2020) do, RSL selects a set of regressions, trajectories of sets of states, or preimages, found via the application of well-known and efficient pre-imaging operators (Rintanen 2008) that rely on symbolic action descriptions. These trajectories found along a given regression, always start from the set of goal states of an instance, and then training states are sampled from each set along the trajectory.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Sampling from Pre-Images to Learn Heuristic Functions for Classical Planning (Extended Abstract)

O'Toole

Ramírez

Lipovetzky

et al. 2022

SOCS

View full text Add to dashboard Cite

We introduce a new algorithm, Regression based Supervised Learning (RSL), for learning per instance Neural Network (NN) defined heuristic functions for classical planning problems. RSL uses regression to select relevant sets of states at a range of different distances from the goal. RSL then formulates a Supervised Learning problem to obtain the parameters that define the NN heuristic, using the selected states labeled with exact or estimated distances to goal states. Our experimental study shows that RSL outperforms, in terms of coverage, previous classical planning NN heuristics functions while requiring a fraction of the training time.

show abstract

Section: Experimental Studymentioning

confidence: 92%

Section: Extracting State Sets Through Regressionmentioning

confidence: 99%

Section: Experimental Studymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Sampling from Pre-Images to Learn Heuristic Functions for Classical Planning (Extended Abstract)

O'Toole

Ramírez

Lipovetzky

et al. 2022

SOCS

View full text Add to dashboard Cite

show abstract

“…Successes include the AlphaGo series (Silver et al 2016(Silver et al , 2017(Silver et al , 2018, as well as heuristic search for single-agent games such as Rubik's Cube (Agostinelli et al 2019). Given the prominence of heuristic search in AI Planning (Hoffmann and Nebel 2001;Helmert and Domshlak 2009;Richter and Westphal 2010;Helmert et al 2014;Domshlak, Hoffmann, and Katz 2015), training NNs as heuristic functions is highly promising, and is actively pursued (Toyer et al 2018;Garg, Bajpai, and Mausam 2019;Ferber, Helmert, and Hoffmann 2020;Shen, Trevizan, and Thiébaux 2020;Rivlin, Hazan, and Karpas 2020;Yu, Kuroiwa, and Fukunaga 2020;Karia and Srivastava 2021;Ferber et al 2022). We contribute a new angle to this line of research, honing in on NN prediction confidence.…”

Section: Introductionmentioning

confidence: 99%

Neural Network Heuristic Functions: Taking Confidence into Account

Heller

Ferber

Bitterwolf

et al. 2022

SOCS

View full text Add to dashboard Cite

Neural networks (NN) are increasingly investigated in AI Planning, and are used successfully to learn heuristic functions. NNs commonly not only predict a value, but also output a confidence in this prediction. From the perspective of heuristic search with NN heuristics, it is a natural idea to take this into account, e.g. falling back to a standard heuristic where confidence is low. We contribute an empirical study of this idea. We design search methods which prune nodes, or switch between search queues, based on the confidence of NNs. We furthermore explore the possibility of out-of-distribution (OOD) training, which tries to reduce the overconfidence of NNs on inputs different to the training distribution. In experiments on IPC benchmarks, we find that our search methods improve coverage over standard methods, and that OOD training has the desired effect in terms of prediction accuracy and confidence, though its impact on search seems marginal.

show abstract

K-Focal Search for Slow Learned Heuristics

Greco,

Toro,

Hernández

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Bounded suboptimal heuristic search is a family of search algorithms capable of solving hard combinatorial problems, returning suboptimal solutions within a given bound. Recent machine learning approaches have been shown to learn accurate heuristic functions. Learned heuristics, however, are slow to compute; concretely, given a single search state s and a learned heuristic h, evaluating h(s) is typically very slow relative to expansion time, since state-of-the-art learned heuristics are implemented as neural networks. However, by using a Graphics Processing Unit (GPU), it is possible to compute heuristics using batched computation. Existing approaches to batched heuristic computation are specific to satisficing search and have not studied the problem in the context of bounded-suboptimal search. In this paper, we present K-Focal Search, a bounded suboptimal search algorithm that in each iteration expands K states from the FOCAL list and computes the learned heuristic values of the successors using a GPU. We experiment over the 24puzzle and Rubik's Cube using DeepCubeA, a very effective and inadmissible learned heuristic. Our results show that K-Focal Search benefits both from batched computation and from the diversity in the search introduced by its expansion strategy. Over standard Focal Search, K-Focal Search improves runtime by a factor of 6, expansions by up to three orders of magnitude, and finds better quality solutions, keeping the theoretical guarantees of Focal Search.

show abstract

Neural Network Heuristic Functions for Classical Planning: Bootstrapping and Comparison to Other Methods

Cited by 7 publications

References 15 publications

Sampling from Pre-Images to Learn Heuristic Functions for Classical Planning (Extended Abstract)

Sampling from Pre-Images to Learn Heuristic Functions for Classical Planning (Extended Abstract)

Neural Network Heuristic Functions: Taking Confidence into Account

K-Focal Search for Slow Learned Heuristics

Contact Info

Product

Resources

About