2020
DOI: 10.1007/s00186-020-00706-w
|View full text |Cite
|
Sign up to set email alerts
|

First-order sensitivity of the optimal value in a Markov decision model with respect to deviations in the transition probability function

Abstract: Markov decision models (MDM) used in practical applications are most often less complex than the underlying 'true' MDM. The reduction of model complexity is performed for several reasons. However, it is obviously of interest to know what kind of model reduction is reasonable (in regard to the optimal value) and what kind is not. In this article we propose a way how to address this question. We introduce a sort of derivative of the optimal value as a function of the transition probabilities, which can be used t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 38 publications
(99 reference statements)
0
3
0
Order By: Relevance
“…The largest generator of d M is called the maximal generator of d M and will be denoted by [35,Definition 3.1]. Examples for d M and M are discussed in Kern et al [26] and Müller [34,35]. Now denote by P ψ the set of all transition functions P = (P n ) N −1 n=0 ∈ P with P n ((x, a), • ) ∈ M ψ 1 (E) for all (x, a) ∈ D n and n = 0, .…”
Section: Continuous Dependence Of the Optimal Value On The Transition...mentioning
confidence: 99%
See 2 more Smart Citations
“…The largest generator of d M is called the maximal generator of d M and will be denoted by [35,Definition 3.1]. Examples for d M and M are discussed in Kern et al [26] and Müller [34,35]. Now denote by P ψ the set of all transition functions P = (P n ) N −1 n=0 ∈ P with P n ((x, a), • ) ∈ M ψ 1 (E) for all (x, a) ∈ D n and n = 0, .…”
Section: Continuous Dependence Of the Optimal Value On The Transition...mentioning
confidence: 99%
“…where we set inf ∅ := ∞. Examples for M and M are discussed in Kern et al [26] and Müller [34]. In the following result, we assume that ψ is a bounding function for any Q ∈ P ψ .…”
Section: Continuous Dependence Of the Optimal Value On The Transition...mentioning
confidence: 99%
See 1 more Smart Citation