2020 IEEE International Conference on Robotics and Automation (ICRA) 2020
DOI: 10.1109/icra40945.2020.9196898
|View full text |Cite
|
Sign up to set email alerts
|

A Model-Free Approach to Meta-Level Control of Anytime Algorithms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 29 publications
0
4
0
Order By: Relevance
“…However, it is often not feasible to calculate the cost of the optimal solution for complex problems. Like earlier work (Hansen and Zilberstein 2001;Svegliato, Wray, and Zilberstein 2018;Svegliato, Sharma, and Zilberstein 2020), we estimate solution quality as the ratio, q = h(s 0 )/ζ, with h(s 0 ) as the h-value of the initial state s 0 and ζ as the cost of the final solution.…”
Section: Methodsmentioning
confidence: 99%
“…However, it is often not feasible to calculate the cost of the optimal solution for complex problems. Like earlier work (Hansen and Zilberstein 2001;Svegliato, Wray, and Zilberstein 2018;Svegliato, Sharma, and Zilberstein 2020), we estimate solution quality as the ratio, q = h(s 0 )/ζ, with h(s 0 ) as the h-value of the initial state s 0 and ζ as the cost of the final solution.…”
Section: Methodsmentioning
confidence: 99%
“…While fixed allocation is effective given negligible uncertainty in the performance of the anytime algorithm, there is often substantial uncertainty in real-time planning (Paul et al 1991). Hence, a more sophisticated approach, namely monitoring and control, tracks the performance of the algorithm and estimates a stopping point at runtime periodically (Horvitz 1990;Zilberstein and Russell 1995;Hansen and Zilberstein 2001;Lin et al 2015;Svegliato, Wray, and Zilberstein 2018;Svegliato, Sharma, and Zilberstein 2020). Our approach not only determines the stopping point but also tunes the hyperparameters of an anytime algorithm at runtime.…”
Section: Related Workmentioning
confidence: 99%
“…The most related work to ours is by Hansen and Zilberstein [11], where they proposed a dynamic-programming approach to solve the model-based variant of meta-reasoning. They later developed online approaches, such as online performance prediction [10] and RL-based model-free metareasoning [12] in an attempt to remove the necessity of preprocessing and gathering data. They particularly considered the solution quality to be safety, yielding smooth performance profiles.…”
Section: Related Workmentioning
confidence: 99%
“…We also investigate model-free approaches that use the function-approximation capabilities of neural networks to mitigate the curse of dimensionality. As meta-reasoning is a control problem, approximate dynamic programming or reinforcement learning (RL) can be adopted to learn a policy [12]. However, we observe that in our problem, we have access to an oracle for the optimal decision policy for each performance profile in the dataset, simply by letting the motion planner run long enough so that we get diminishing returns and determining, post hoc, the optimal stopping time for each training example.…”
Section: Introductionmentioning
confidence: 99%