1992
DOI: 10.1109/9.159584
|View full text |Cite
|
Sign up to set email alerts
|

Perturbation and stability theory for Markov control problems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
19
0
2

Year Published

2002
2002
2014
2014

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 50 publications
(21 citation statements)
references
References 11 publications
0
19
0
2
Order By: Relevance
“…Obviously, v πs P is continuous in P [11]. However, P can be discontinuous in θ over Θ, which may lead to discontinuous value functions v πs P in θ.…”
Section: Introductionmentioning
confidence: 94%
“…Obviously, v πs P is continuous in P [11]. However, P can be discontinuous in θ over Θ, which may lead to discontinuous value functions v πs P in θ.…”
Section: Introductionmentioning
confidence: 94%
“…Thus, optimal solutions via classic DP are difficult to attain. It is not difficult to imagine that the estimated transition probabilities may be far from the true values due to noise and other errors associated with the estimation process, or the estimation error may be nontrivial such that it results in significant deviations from true optimal solutions [1], [8], [12]. Therefore, the ideas of set estimation for transition matrices with high confidence and robust DP were proposed to alleviate some of the deficits from both inaccurate transition matrix models and point estimation.…”
Section: Introductionmentioning
confidence: 99%
“…[12], [13], [14], [15]. In this paper we exploit the structured LP formulation proposed in [9] and ACCPM to provide an efficient algorithm for solving ergodic MDPs with strong and weak interactions.…”
Section: Markov Decision Processes (Mdps) or Their Control Counterparmentioning
confidence: 99%