2009
DOI: 10.1017/s0021900200005520
|View full text |Cite
|
Sign up to set email alerts
|

Blackwell Optimality for Controlled Diffusion Processes

Abstract: In this paper we study m-discount optimality (m ≥ −1) and Blackwell optimality for a general class of controlled (Markov) diffusion processes. To this end, a key step is to express the expected discounted reward function as a Laurent series, and then search certain control policies that lexicographically maximize the mth coefficient of this series for m = −1, 0, 1, . . . . This approach naturally leads to m-discount optimality and it gives Blackwell optimality in the limit as m → ∞.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 29 publications
0
3
0
Order By: Relevance
“…This is an extension of the example presented in [30], which in turn was motivated by the manufacturing system in [31]. A variant of this system was also studied in [18,19].…”
Section: An Examplementioning
confidence: 91%
See 1 more Smart Citation
“…This is an extension of the example presented in [30], which in turn was motivated by the manufacturing system in [31]. A variant of this system was also studied in [18,19].…”
Section: An Examplementioning
confidence: 91%
“…To obtain (33), first note that, by (30) and (6), h π 1 ,π 2 is indeed μ π 1 ,π 2 -integrable for every (π 1 , π 2 ) ∈ Π 1 × Π 2 . Then in (33) choose the distribution of the initial state to be μ π 1 ,π 2 and so (33) follows from Fubini's theorem and the invariance of μ π 1 ,π 2 .…”
Section: Proposition 51 For Eachmentioning
confidence: 99%
“…In fact, the EAR criterion does not distinguish two different average optimal policies that have different finite-horizon rewards. To avoid this situation, many authors consider more sensitive criteria such as (i) the variance-minimization criterion (see, for instance [2,9,10] and their references); (ii) overtaking optimality criteria, such as Flynn's 'opportunity cost', Dutta's criterion, and bias optimality, etc (see [2,[11][12][13][14][15] for instance); and (iii) 'discount-sensitive' criteria (see [14,[16][17][18][19]). …”
Section: Introductionmentioning
confidence: 99%