2022
DOI: 10.1287/moor.2020.1116
|View full text |Cite
|
Sign up to set email alerts
|

Finite-Memory Strategies in POMDPs with Long-Run Average Objectives

Abstract: Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the decision maker has approximately optimal strategies with finite memory. This implies notably that approximating the long-run value is recursively enumerable, as well as a weak continuity property of the value with respect to the transition function.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 55 publications
0
5
0
Order By: Relevance
“…The key point of the proof is the existence of a pure strategy with finite memory that is εoptimal in the problem with lim inf evaluation, proved in [7]. As highlighted before, the lim inf value v ∞ (x 1 ) has been shown to be equal to v * (x 1 ).…”
Section: The Decision-maker Guarantees At Least V * (X 1 )mentioning
confidence: 85%
See 4 more Smart Citations
“…The key point of the proof is the existence of a pure strategy with finite memory that is εoptimal in the problem with lim inf evaluation, proved in [7]. As highlighted before, the lim inf value v ∞ (x 1 ) has been shown to be equal to v * (x 1 ).…”
Section: The Decision-maker Guarantees At Least V * (X 1 )mentioning
confidence: 85%
“…The supremum can be taken over pure strategies as a direct consequence of Theorem 5.2 in Feinberg [10]. This value coincides with the asymptotic value v * (x 1 ) [19], and for all ε > 0, there exists an ε-optimal pure strategy with finite memory [7].…”
Section: Limsup Evaluationsmentioning
confidence: 90%
See 3 more Smart Citations