2009
DOI: 10.1007/978-3-642-02894-6_15
|View full text |Cite
|
Sign up to set email alerts
|

A Rollout Algorithm for Multichain Markov Decision Processes with Average Cost

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…However, the ideas they propose can be incorporated to guide initial exploration of actions in approaches like References [1,52]. • Relaxing certain theoretical assumptions like non-communicating MDPs [23], multi-chain MDPs [67], and so on, can further improve the applicability of regret-based approaches in control-based approaches. • Most of the model-based and model-free approaches in Section 5 are not scalable to large problem sizes.…”
Section: Future Directionsmentioning
confidence: 99%
“…However, the ideas they propose can be incorporated to guide initial exploration of actions in approaches like References [1,52]. • Relaxing certain theoretical assumptions like non-communicating MDPs [23], multi-chain MDPs [67], and so on, can further improve the applicability of regret-based approaches in control-based approaches. • Most of the model-based and model-free approaches in Section 5 are not scalable to large problem sizes.…”
Section: Future Directionsmentioning
confidence: 99%
“…However, the ideas they propose can be incorporated to guide initial exploration of actions in approaches like [29], [30]. • Relaxing certain theoretical assumptions like noncommunicating MDPs [72], multi-chain MDPs [73] etc can further improve the applicability of regret-based approaches in control-based approaches.…”
Section: Future Directionsmentioning
confidence: 99%
“…Therefore, any prospective methodology must incorporate such a limitation in its solution process. We incorporate the Optimal Computing Budget Allocation (OCBA) algorithm into our MDP solution process [2], [3] to address the limited simulation budget problem.…”
Section: Introductionmentioning
confidence: 99%