49th IEEE Conference on Decision and Control (CDC) 2010
DOI: 10.1109/cdc.2010.5717895
|View full text |Cite
|
Sign up to set email alerts
|

Toward an optimized value iteration algorithm for average cost Markov decision processes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
10
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…Indeed, when a suitable decreasing rate is found, it can result in significant computational savings. However, a poor choice of decreasing may result in an inefficient algorithm, which can even be outperformed by standard value iteration [14]. In this paper we address this short-coming by introducing an algorithm that adaptively decreases the error sequence k  , and that results in a more robust algorithm, with more stable behavior that consistently outperforms standard value iteration.…”
Section: The Parameter Sequence K mentioning
confidence: 99%
See 4 more Smart Citations
“…Indeed, when a suitable decreasing rate is found, it can result in significant computational savings. However, a poor choice of decreasing may result in an inefficient algorithm, which can even be outperformed by standard value iteration [14]. In this paper we address this short-coming by introducing an algorithm that adaptively decreases the error sequence k  , and that results in a more robust algorithm, with more stable behavior that consistently outperforms standard value iteration.…”
Section: The Parameter Sequence K mentioning
confidence: 99%
“…The unknown rate of convergence renders the results in [13] not directly applicable for the studied problem. Earlier results, however, have shown that significant reduction on the overall computational effort can be attained by a suitable choice of refinement rate [14]. Unfortunately, such rate is now known a priori and the parameter tuning turns out to be very difficult.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations