2020 IEEE International Conference on Robotics and Automation (ICRA) 2020
DOI: 10.1109/icra40945.2020.9197014
|View full text |Cite
|
Sign up to set email alerts
|

Posterior Sampling for Anytime Motion Planning on Graphs with Expensive-to-Evaluate Edges

Abstract: Collision checking is a computational bottleneck in motion planning, requiring lazy algorithms that explicitly reason about when to perform this computation. Optimism in the face of collision uncertainty minimizes the number of checks before finding the shortest path. However, this may take a prohibitively long time to compute, with no other feasible paths discovered during this period. For many real-time applications, we instead demand strong anytime performance, defined as minimizing the cumulative lengths o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 34 publications
(39 reference statements)
0
10
0
Order By: Relevance
“…If the goal is not reached the robot receives some large negative reward.2. Observations ψ are shared and the obstacles remain fixed between iterations.This Repeated BTP is analogous to the Experienced Lazy Path Search problem, for which Thompson sampling (within an algorithm called PSMP) has bounded regret compared to the optimal policy always taking the shortest path (Hou et al 2020). Consider the strategy that attempts the path from Thompson sampling, and if a collision occurs backtracks to the start and executes the shortest path found so far.…”
Section: A1 Heuristic Estimates Of Q-valuesmentioning
confidence: 99%
“…If the goal is not reached the robot receives some large negative reward.2. Observations ψ are shared and the obstacles remain fixed between iterations.This Repeated BTP is analogous to the Experienced Lazy Path Search problem, for which Thompson sampling (within an algorithm called PSMP) has bounded regret compared to the optimal policy always taking the shortest path (Hou et al 2020). Consider the strategy that attempts the path from Thompson sampling, and if a collision occurs backtracks to the start and executes the shortest path found so far.…”
Section: A1 Heuristic Estimates Of Q-valuesmentioning
confidence: 99%
“…While (Cohen, Phillips, and Likhachev 2015;Haghtalab et al 2018;Mandalika et al 2019) are designed to search directly for the optimal path, our Algorithm 1 is an Anytime algorithm. Note that PSMP (Hou et al 2020) is also based on LAZYSP and is an anytime algorithm. However, it is not related to our problem.…”
Section: Related Workmentioning
confidence: 99%
“…• Identifying samples that are guaranteed to be valid (Bialkowski et al, 2016;Bialkowski et al, 2013) • Using a learned model in place of a collision detector (Burns and Brock, 2005;Huh and Lee, 2016;Das and Yip, 2020;Kew et al, 2020;Yu and Gao, 2021) • Determining the order in which to collision check nodes or edges (Pan et al, 2013;Bhardwaj et al, 2019;Choudhury et al, 2017;Choudhury et al, 2018;Hou et al, 2020)…”
Section: Categories Of Work On Collision Checkingmentioning
confidence: 99%
“…Posterior Sampling for Motion Planning (PSMP) (Hou et al, 2020) formulates anytime search on graphs as an instance of Bayesian Reinforcement Learning (Bayesian RL). Unlike prior work, PSMP aims for anytime performance by leveraging learned posteriors on edge collisions to quickly discover an initial feasible path and progressively yield shorter paths.…”
Section: Determining the Order In Which To Collision Check Nodes/edgesmentioning
confidence: 99%