2017
DOI: 10.1287/moor.2016.0832
|View full text |Cite
|
Sign up to set email alerts
|

On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces

Abstract: Calculating optimal policies is known to be computationally difficult for Markov decision processes (MDPs) with Borel state and action spaces. This paper studies finite-state approximations of discrete time Markov decision processes with Borel state and action spaces, for both discounted and average costs criteria. The stationary policies thus obtained are shown to approximate the optimal stationary policy with arbitrary precision under quite general conditions for discounted cost and more restrictive conditio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
65
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
2

Relationship

3
6

Authors

Journals

citations
Cited by 43 publications
(65 citation statements)
references
References 45 publications
0
65
0
Order By: Relevance
“…Proof. The result follows from (16) and (17), the fact that W n and W are continuous at k, and Theorem 5. It can be done similar to the proof of Theorem 2, and so, we omit the details.…”
Section: Asymptotic Approximation Of Average Cost Problemsmentioning
confidence: 72%
“…Proof. The result follows from (16) and (17), the fact that W n and W are continuous at k, and Theorem 5. It can be done similar to the proof of Theorem 2, and so, we omit the details.…”
Section: Asymptotic Approximation Of Average Cost Problemsmentioning
confidence: 72%
“…Under the assumption that the transition probability is weakly continuous in stateaction variables and setwise continuous in action variable, it was shown that if one uses sufficiently large number of grid points to partition the state space, then the resulting finite state MDP gives a near optimal policy. In [13] (i.e., full paper version of this work) a rate of convergence analysis and numerical results are also presented.…”
Section: Discussionmentioning
confidence: 99%
“…Even though in this paper we are only concerned with the asymptotic optimality of finite models, in the full paper [13], we consider also the rates of convergence; that is, we quantify how the size of the approximate finite model is related to the approximation error.…”
Section: Introductionmentioning
confidence: 99%
“…However, it still requires the Lipschitz continuity of some components of dynamic programs. Unlike the aforementioned approaches, the finite-state and finite-action approximation method for MDPs with σ-compact state spaces proposed by Saldi et al does not rely on Lipschitz-type continuity conditions [15,16].…”
Section: Related Workmentioning
confidence: 99%