2014
DOI: 10.1287/opre.2014.1281
|View full text |Cite
|
Sign up to set email alerts
|

Markov Decision Problems Where Means Bound Variances

Abstract: We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of the optimal total reward can be bounded by a simple linear function of its expected value. The class is characterized by three natural properties: reward non-negativity and boundedness, existence of a do-nothing action, and optimal action monotonicity. These properties are commonly present and typically easy to check. Implications of the class properties and of the variance bound are illustrated by examples of … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
6
1

Relationship

3
4

Authors

Journals

citations
Cited by 13 publications
(8 citation statements)
references
References 42 publications
0
8
0
Order By: Relevance
“…For a particular class of problems, relevant in the study of Markov decision processes where means bound variances (Arlotto et al 2014), Proposition 4 automatically implies full exploration of every post-decision state, as stated in the following corollary.…”
Section: Asymptotic Analysis Of Offline Kgmentioning
confidence: 96%
“…For a particular class of problems, relevant in the study of Markov decision processes where means bound variances (Arlotto et al 2014), Proposition 4 automatically implies full exploration of every post-decision state, as stated in the following corollary.…”
Section: Asymptotic Analysis Of Offline Kgmentioning
confidence: 96%
“…This condition does not fully escape the Saint Petersburg Paradox, but it does give one some practical guidance. Moreover, it was found in Arlotto et al (2014) that such closeness in probability holds automatically for a large class of Markov decision problems.…”
Section: Weighing Costs With More Than Meansmentioning
confidence: 99%
“…Papastavrou et al [12]), only little is known about the limiting distribution of the optimal number of size-focused knapsack selections. From Arlotto et al [2] one has a suggestive upper bound on the variance, but at this point one does not know for sure that this bound gives the principle term of the variance for large n.…”
Section: Observations Connections and Problemsmentioning
confidence: 99%