2021
DOI: 10.1016/j.jcss.2021.02.006
|View full text |Cite
|
Sign up to set email alerts
|

The complexity of reachability in parametric Markov decision processes

Abstract: This paper studies parametric Markov decision processes (pMDPs), an extension to Markov decision processes (MDPs) where transitions probabilities are described by polynomials over a finite set of parameters. Fixing values for all parameters yields MDPs. In particular, this paper studies the complexity of finding values for these parameters such that the induced MDP satisfies some reachability constraints. We discuss different variants depending on the comparison operator in the constraints and the domain of th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

5
3

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 44 publications
0
10
0
Order By: Relevance
“…Optimization-based approaches use branch-and-bound (Meuleau et al, 1999) or segmentation into reachable sets (Sharan & Burdick, 2014) to limit the searchable space. Junges et al (2018) construct an FSC using parameter synthesis for Markov chains, which is known to be ETR-complete (Junges, Katoen, et al, 2021), whereas NP ⊆ ETR ⊆ PSPACE. Carr et al (2018) render common POMDP scenarios as arcade games to capture human preferences that are formally cast into FSCs and subsequently verified.…”
Section: Related Workmentioning
confidence: 99%
“…Optimization-based approaches use branch-and-bound (Meuleau et al, 1999) or segmentation into reachable sets (Sharan & Burdick, 2014) to limit the searchable space. Junges et al (2018) construct an FSC using parameter synthesis for Markov chains, which is known to be ETR-complete (Junges, Katoen, et al, 2021), whereas NP ⊆ ETR ⊆ PSPACE. Carr et al (2018) render common POMDP scenarios as arcade games to capture human preferences that are formally cast into FSCs and subsequently verified.…”
Section: Related Workmentioning
confidence: 99%
“…The reachability function can grow exponentially in the number of parameters, even for acyclic pMCs [1]. Feasibility is a computationally hard problem: finding parameter values for a pMC that satisfy a reachability objective is ETRcomplete [34] 4 . Feasibility has been tackled using sampling search methods such as PSO and Markov Chain Monte Carlo [16] and solving a non-linear optimization problem [4].…”
Section: Computing Reachability Functions Compute the Rational Functi...mentioning
confidence: 99%
“…Typical objectives are of the form: is the probability to reach some states below (or above) a given threshold q? The complexity of various pMC synthesis problems is studied in [1,34].…”
Section: Introductionmentioning
confidence: 99%
“…The challenge in applying parameter synthesis is twofold: whereas the problem is ETR-complete 4 [34], the number of parameters grows linear in the number of different observations and the number of actions available to the controller. For many real-life applications we must thus deal with thousands of parameters.…”
Section: Introductionmentioning
confidence: 99%
“…Technically, we use graph-preserving to ensure continuously differentiability of ER s→ M . For acyclic pMCs, these functions are continuously differentiable without assuming graph-preservation[34].…”
mentioning
confidence: 99%