2019 IEEE 58th Conference on Decision and Control (CDC) 2019
DOI: 10.1109/cdc40024.2019.9029885
|View full text |Cite
|
Sign up to set email alerts
|

Solving Ergodic Markov Decision Processes and Perfect Information Zero-sum Stochastic Games by Variance Reduced Deflated Value Iteration

Abstract: Recently, Sidford, Wang, Wu and Ye (2018) developed an algorithm combining variance reduction techniques with value iteration to solve discounted Markov decision processes. This algorithm has a sublinear complexity when the discount factor is fixed. Here, we extend this approach to meanpayoff problems, including both Markov decision processes and perfect information zero-sum stochastic games. We obtain sublinear complexity bounds, assuming there is a distinguished state which is accessible from all initial st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 13 publications
0
1
0
Order By: Relevance
“…1 and Th .2], using tools from non-linear Perron-Frobenius theory [AGN11]. In particular, we refer to [AGQS19] for background on weighted sup-norms.…”
Section: The Different Convergence Phasesmentioning
confidence: 99%
“…1 and Th .2], using tools from non-linear Perron-Frobenius theory [AGN11]. In particular, we refer to [AGQS19] for background on weighted sup-norms.…”
Section: The Different Convergence Phasesmentioning
confidence: 99%