Distribution-based objectives for Markov Decision Processes

Akshay, S.; Genest, Blaise; Vyas, Nikhil

doi:10.1145/3209108.3209185

Cited by 10 publications

(11 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The global state of the game is a distribution over the local states of the processes, and the specification describes which sequences of distributions are winning. The distributions can be discrete [AAGT12,CFO20] or continuous [KVAK10,AGV18]. The control may be applied uniformly, independently of the local state of each process, as in non-deterministic [BDGG17], and probabilistic automata [CFO20], or it may depend on the local history of states, as in Markov decision processes (MDPs) [AGV18,DMS19].…”

Section: Introductionmentioning

confidence: 99%

“…The distributions can be discrete [AAGT12,CFO20] or continuous [KVAK10,AGV18]. The control may be applied uniformly, independently of the local state of each process, as in non-deterministic [BDGG17], and probabilistic automata [CFO20], or it may depend on the local history of states, as in Markov decision processes (MDPs) [AGV18,DMS19]. In both cases imperfect information arises: either because the control is global, thus not aware of the local state of individual processes, or because the control is local, thus not aware of the global states on which the specification is defined.…”

Section: Introductionmentioning

confidence: 99%

“…The works on (discrete) parameterized control considered finitary synchronization objectives (reachability of a synchronized distribution), either with an adversary [BDGG17], or with stochasticity [CFO20], but not with both. With continuous distributions, the central model that has been studied is MDPs, thus with stochasticity but no adversary, either for finitary [AGV18] or infinitary objectives [AAGT12,DMS19]. The solution of the control problem studied in this paper is known for MDPs [DMS19].…”

Section: Introductionmentioning

confidence: 99%

“…Many systems in natural computing exhibit several instances of the same anonymous process (without pre-defined identity or hierarchy), from particle physics to flock of birds. Examples of biological systems such as yeast [BDGG17,AGV18], and simple chemical systems [KVAK10] illustrate the synthesis applications of this model. The same principle underlies synthetic biology where a local control program is executed in every instance of the process [EL00, NDS + 16].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Stochastic Games with Synchronizing Objectives

Doyen¹

2022

Preprint

View full text Add to dashboard Cite

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and twoplayer deterministic games by adding stochasticity. The outcome of the game is a sequence of distributions over the states of the game graph. We consider synchronizing objectives, which require the probability mass to accumulate in a set of target states, either always, once, infinitely often, or always after some point in the outcome sequence; and the winning modes of sure winning (if the accumulated probability is equal to 1) and almost-sure winning (if the accumulated probability is arbitrarily close to 1).We present algorithms to compute the set of winning distributions for each of these synchronizing modes, showing that the corresponding decision problem is PSPACE-complete for synchronizing once and infinitely often, and PTIME-complete for synchronizing always and always after some point. These bounds are remarkably in line with the special case of MDPs, while the algorithmic solution and proof technique are considerably more involved, even for deterministic games. This is because those games have a flavour of imperfect information, in particular they are not determined and randomized strategies need to be considered, even if there is no stochastic choice in the game graph. Moreover, in combination with stochasticity in the game graph, finite-memory strategies are not sufficient in general (for synchronizing infinitely often).

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Stochastic Games with Synchronizing Objectives

Doyen¹

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In the more recent distribution-based semantics, the outcome of a stochastic process is a sequence of distributions over states [3,19]. This alternative semantics has received some attention recently for theoretical analysis of probabilistic bisimulation [17] and is adequate to describe large populations of agents [14,10] with applications in system biology [19,1]. The behaviour of an agent is modeled as an MDP with some state space Q, and a large population of identical agents is described by a (continuous) distribution d : Q → [0, 1] that gives the fraction d(q) of agents in the population that are in each state q ∈ Q.…”

Section: Introductionmentioning

confidence: 99%

Bounds for Synchronizing Markov Decision Processes

Doyen¹,

Bogaard²

2022

Preprint

View full text Add to dashboard Cite

We consider Markov decision processes with synchronizing objectives, which require that a probability mass of 1 − ε accumulates in a designated set of target states, either once, always, infinitely often, or always from some point on, where ε = 0 for sure synchronizing, and ε → 0 for almost-sure and limit-sure synchronizing. We introduce two new qualitative modes of synchronizing, where the probability mass should be either positive, or bounded away from 0. They can be viewed as dual synchronizing objectives. We present algorithms and tight complexity results for the problem of deciding if a Markov decision process is positive, or bounded synchronizing, and we provide explicit bounds on ε in all synchronizing modes. In particular, we show that deciding positive and bounded synchronizing always from some point on, is coNP-complete.

show abstract

Bounds for Synchronizing Markov Decision Processes

Doyen¹,

Bogaard

2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Distribution-based objectives for Markov Decision Processes

Cited by 10 publications

References 28 publications

Stochastic Games with Synchronizing Objectives

Stochastic Games with Synchronizing Objectives

Bounds for Synchronizing Markov Decision Processes

Bounds for Synchronizing Markov Decision Processes

Contact Info

Product

Resources

About