2010 Seventh International Conference on the Quantitative Evaluation of Systems 2010
DOI: 10.1109/qest.2010.35
|View full text |Cite
|
Sign up to set email alerts
|

Reasoning about MDPs as Transformers of Probability Distributions

Abstract: Abstract-We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as exhibiting a behavior that is a sequence of probability distributions. Defining propositions using linear inequalities, one can reason about executions over the space of probability distributions. In this framework, one can analyze properties that cannot be expressed in logics like PCTL * , such as expressing bounds on transie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
55
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(55 citation statements)
references
References 20 publications
0
55
0
Order By: Relevance
“…These specifications are undoubtedly interesting since they are ubiquitous; however there are many interesting and intuitive properties that they can not express, in particular, considerations about the dynamical evolution of state probabilities are either convoluted or impossible to state in these frameworks, as pointed in e.g. [15,13,7,1].…”
Section: Introductionmentioning
confidence: 99%
“…These specifications are undoubtedly interesting since they are ubiquitous; however there are many interesting and intuitive properties that they can not express, in particular, considerations about the dynamical evolution of state probabilities are either convoluted or impossible to state in these frameworks, as pointed in e.g. [15,13,7,1].…”
Section: Introductionmentioning
confidence: 99%
“…On the contrary, in this work, the control objective is to achieve a given steadystate distribution. In a recent line of work [3,6], the authors consider transient properties of MDP viewed as transformers of probability distributions. Compared to that setting, we are interested rather in long run properties.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, an alternative semantics for probabilistic automata has been proposed, with applications in sensor networks, queuing theory, and dynamical systems [9,8,5]. In this new semantics, a run over an input word is the sequence of probability distributions produced by the automaton.…”
Section: Introductionmentioning
confidence: 99%
“…The space of probability distributions (which is a subset of [0,1] n ) is partitioned into regions defined by linear predicates, and classical acceptance conditions are used to define accepting sequences of regions. It is known that reachability of a region is undecidable for linear predicates, and that it becomes decidable for a class of qualitative predicates which essentially constrain only the support of the probability distributions [8].…”
Section: Introductionmentioning
confidence: 99%