2013
DOI: 10.1239/aap/1370870127
|View full text |Cite
|
Sign up to set email alerts
|

Absorbing Continuous-Time Markov Decision Processes with Total Cost Criteria

Abstract: In this paper we study absorbing continuous-time Markov decision processes in Polish state spaces with unbounded transition and cost rates, and history-dependent policies. The performance measure is the expected total undiscounted costs. For the unconstrained problem, we show the existence of a deterministic stationary optimal policy, whereas, for the constrained problems with N constraints, we show the existence of a mixed stationary optimal policy, where the mixture is over no more than N+1 deterministic sta… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 8 publications
(16 citation statements)
references
References 34 publications
(68 reference statements)
0
16
0
Order By: Relevance
“…This work is organized as follows: To focus on the development of compactification method in [17,21] for the optimal control problem from the setting of diffusion processes to that of CTMDPs, we consider in Section 2 the optimal control problem for classical CTMDPs without any random impact of the environment. The class of ψ-relaxed controls is a subset of classical admissible controls studied, for instance, in [12,13,15]. Therefore, our existence result of optimal ψ-relaxed control is a little stronger than the existence of classical admissible controls.…”
Section: Introductionmentioning
confidence: 89%
See 1 more Smart Citation
“…This work is organized as follows: To focus on the development of compactification method in [17,21] for the optimal control problem from the setting of diffusion processes to that of CTMDPs, we consider in Section 2 the optimal control problem for classical CTMDPs without any random impact of the environment. The class of ψ-relaxed controls is a subset of classical admissible controls studied, for instance, in [12,13,15]. Therefore, our existence result of optimal ψ-relaxed control is a little stronger than the existence of classical admissible controls.…”
Section: Introductionmentioning
confidence: 89%
“…Continuous-time Markov decision processes (CTMDPs) have been extensively studied and widely applied in various application fields such as telecommunication, queueing systems, population processes, epidemiology, and so on. See, for instance, the monographs [12,26], the works [10,11,13,14,15,19,24,25] and references therein. As an illustrative example, we consider the controlled queueing systems.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, it follows from Assumptions 3.1(ii) and 3.1(iii) that sup μ∈D μ(K) = sup η∈D K w dη < ∞. Thus, by [11,Lemma 7],D is relatively compact in (M + (K), σ (M + (K))).…”
Section: Q} and The Continuity Ofmentioning
confidence: 95%
“…(i) The occupation measures are in fact state-action frequencies. They are widely used in MDPs so as to transform a stochastic dynamic control problem to a static optimization problem; see [9], [11], [13], [15], [18], and [19]. In the next theorem we state some characterizations of the elements of D. To prove the second assertion of (ii), suppose that we have…”
Section: Occupation Measuresmentioning
confidence: 99%
See 1 more Smart Citation