2012
DOI: 10.1017/s0001867800005875
|View full text |Cite
|
Sign up to set email alerts
|

The Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic Approach

Abstract: This paper deals with discrete-time Markov decision processes (MDPs) under constraints where all the objectives have the same form of expected total cost over the infinite time horizon. The existence of an optimal control policy is discussed by using the convex analytic approach. We work under the assumptions that the state and action spaces are general Borel spaces, and that the model is nonnegative, semicontinuous, and there exists an admissible solution with finite cost for the associated linear program. It… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
65
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 15 publications
(67 citation statements)
references
References 9 publications
2
65
0
Order By: Relevance
“…is called to be of finite value. It can be shown as in [9,Thm.3 Appendix 4] or [12]. It can be shown as in the proof of [9, Thm.3.3, Cor.3.1] that the stationary strategy π µ defined by…”
Section: Mdp Approach and The First Linear Programming Methodsmentioning
confidence: 96%
See 4 more Smart Citations
“…is called to be of finite value. It can be shown as in [9,Thm.3 Appendix 4] or [12]. It can be shown as in the proof of [9, Thm.3.3, Cor.3.1] that the stationary strategy π µ defined by…”
Section: Mdp Approach and The First Linear Programming Methodsmentioning
confidence: 96%
“…According to [9,Thm.4.1], for a constrained total cost MDP with Borel state space X ∆ , Borel action space B, transition probability Q, and positive cost functions { C j } J j=0 , if the model is semicontinuous, then, provided that there exists a feasible strategy with finite value, there is an optimal stationary strategy. Here the model is called semicontinuous if its action space B is compact, { C j } J j=0 are all lower semicontinuous, and Q is continuous, i.e., for each bounded continuous function f on X ∆ ,…”
Section: Mdp Approach and The First Linear Programming Methodsmentioning
confidence: 99%
See 3 more Smart Citations