2017
DOI: 10.1613/jair.5328
|View full text |Cite
|
Sign up to set email alerts
|

DESPOT: Online POMDP Planning with Regularization

Abstract: The partially observable Markov decision process (POMDP) provides a principled general framework for planning under uncertainty, but solving POMDPs optimally is computationally intractable, due to the "curse of dimensionality" and the "curse of history". To overcome these challenges, we introduce the Determinized Sparse Partially Observable Tree (DESPOT), a sparse approximation of the standard belief tree, for online planning under uncertainty. A DESPOT focuses online planning on a set of randomly sampled scen… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
134
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 141 publications
(134 citation statements)
references
References 23 publications
0
134
0
Order By: Relevance
“…POMDP planning suffers from the well-known "curse of dimensionality" and "curse of history" [8]: the complexity of planning grows exponentially with the size of the state space and the planning horizon. Two recent belief tree search algorithms, POMCP [9] and DESPOT [10] made online POMDP planning practical for real-world tasks. Both of them use Monte Carlo simulations to sample transitions and observations.…”
Section: A Online Pomdp Planningmentioning
confidence: 99%
“…POMDP planning suffers from the well-known "curse of dimensionality" and "curse of history" [8]: the complexity of planning grows exponentially with the size of the state space and the planning horizon. Two recent belief tree search algorithms, POMCP [9] and DESPOT [10] made online POMDP planning practical for real-world tasks. Both of them use Monte Carlo simulations to sample transitions and observations.…”
Section: A Online Pomdp Planningmentioning
confidence: 99%
“…There are two distinct approaches to planning under uncertainty: offline (e.g., [10,13,17,22]) and online (e.g., [19,21,25]). Offline planning leverages offline computation to reason about all future contingencies in advance and achieves faster execution time online.…”
Section: Background a Online Planning Under Uncertaintymentioning
confidence: 99%
“…For completeness, we provide a brief summary of the DESPOT algorithm. See [25] for details. To overcome the computational challenge of online planning under uncertainty, DESPOT samples a small finite set of K scenarios as representatives of the future.…”
Section: A Despotmentioning
confidence: 99%
See 2 more Smart Citations