2018
DOI: 10.1007/978-3-319-96145-3_36
|View full text |Cite
|
Sign up to set email alerts
|

Value Iteration for Simple Stochastic Games: Stopping Criterion and Learning Algorithm

Abstract: Simple stochastic games can be solved by value iteration (VI), which yields a sequence of under-approximations of the value of the game. This sequence is guaranteed to converge to the value only in the limit. Since no stopping criterion is known, this technique does not provide any guarantees on its results. We provide the first stopping criterion for VI on simple stochastic games. It is achieved by additionally computing a convergent sequence of over-approximations of the value, relying on an analysis of the … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
105
2

Year Published

2018
2018
2020
2020

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 35 publications
(107 citation statements)
references
References 31 publications
0
105
2
Order By: Relevance
“…2. The transition from algorithms for MDP to SG is possible via extending the over-approximating value iteration from MDP [BCC+14] to SG by [KKKW18].…”
Section: The Increased Practical Performance Rests On Two Pillarsmentioning
confidence: 99%
See 4 more Smart Citations
“…2. The transition from algorithms for MDP to SG is possible via extending the over-approximating value iteration from MDP [BCC+14] to SG by [KKKW18].…”
Section: The Increased Practical Performance Rests On Two Pillarsmentioning
confidence: 99%
“…[WT16] combines the special case of almost-sure satisfaction of a specification with optimizing quantitative objectives. We use techniques of [KKKW18], which however assumes access to the transition probabilities.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations