2020
DOI: 10.48550/arxiv.2003.01342
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Sufficiency of Markov Policies for Continuous-Time Jump Markov Decision Processes

Abstract: This paper extends to Continuous-Time Jump Markov Decision Processes (CTJMDP) the classic result for Markov Decision Processes stating that, for a given initial state distribution, for every policy there is a (randomized) Markov policy, which can be defined in a natural way, such that at each time instance the marginal distributions of state-action pairs for these two policies coincide. It is shown in this paper that this equality takes place for a CTJMDP if the corresponding Markov policy defines a nonexplosi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
5
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(5 citation statements)
references
References 25 publications
0
5
0
Order By: Relevance
“…In this case it is also true that V α (γ, ϕ) ≤ V α (γ, ϕ) for a Markov policy ϕ satisfying (4.6), and the equality takes place if P ϕ γ (t, X) = 1 for all t ∈ R + ; see [10,Theorem 5]. As shown in [10, Example 1], this may not be true if the cost function C also depends on an action chosen at jump epochs.…”
Section: Applications Of Theorem 43 To Particular Objective Criteriamentioning
confidence: 99%
See 4 more Smart Citations
“…In this case it is also true that V α (γ, ϕ) ≤ V α (γ, ϕ) for a Markov policy ϕ satisfying (4.6), and the equality takes place if P ϕ γ (t, X) = 1 for all t ∈ R + ; see [10,Theorem 5]. As shown in [10, Example 1], this may not be true if the cost function C also depends on an action chosen at jump epochs.…”
Section: Applications Of Theorem 43 To Particular Objective Criteriamentioning
confidence: 99%
“…As shown in [10, Example 1], this may not be true if the cost function C also depends on an action chosen at jump epochs. Other criteria and additional results can be found in [10,Section 6].…”
Section: Applications Of Theorem 43 To Particular Objective Criteriamentioning
confidence: 99%
See 3 more Smart Citations