2008
DOI: 10.1016/j.ic.2007.09.002
|View full text |Cite
|
Sign up to set email alerts
|

Reachability in recursive Markov decision processes

Abstract: We consider a class of infinite-state Markov decision processes generated by stateless pushdown automata. This class corresponds to 1 1 2player games over graphs generated by BPA systems or (equivalently) 1-exit recursive state machines. An extended reachability objective is specified by two sets S and T of safe and terminal stack configurations, where the membership to S and T depends just on the top-of-the-stack symbol. The question is whether there is a suitable strategy such that the probability of hitting… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
42
0

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 29 publications
(43 citation statements)
references
References 11 publications
1
42
0
Order By: Relevance
“…We say that a strategy is static if for each type T i controlled by that player the strategy always chooses the same action a i , or the same probability distribution on actions, for all entities of type T i in all histories. 4 Our notion of an arbitrary strategy is quite general (it can depend on all the details of the entire history, and be randomized, etc.). However, it was shown in [14] that for the objective of optimizing extinction probability, both players have optimal static strategies in BSSGs.…”
Section: Definitions and Backgroundmentioning
confidence: 99%
“…We say that a strategy is static if for each type T i controlled by that player the strategy always chooses the same action a i , or the same probability distribution on actions, for all entities of type T i in all histories. 4 Our notion of an arbitrary strategy is quite general (it can depend on all the details of the entire history, and be randomized, etc.). However, it was shown in [14] that for the objective of optimizing extinction probability, both players have optimal static strategies in BSSGs.…”
Section: Definitions and Backgroundmentioning
confidence: 99%
“…Also observe that A.2 does not hold for infinite-state MDPs (a counterexample is simple to construct even for a single reachability objective, see e.g. [5,Example 6]). …”
Section: Expectation Objectivesmentioning
confidence: 99%
“…The result about termination has been extended to general qualitative reachability in [13], even for a more general model of Markov decision processes generated by BPA. Thus, we obtain the following:…”
Section:   mentioning
confidence: 99%