2013
DOI: 10.1007/978-3-642-36046-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games

Abstract: Markov decision processes (MDPs) and simple stochastic games (SSGs) provide a rich mathematical framework to study many important problems related to probabilistic systems. MDPs and SSGs with finite-horizon objectives, where the goal is to maximize the probability to reach a target state in a given finite time, is a classical and well-studied problem. In this work we consider the strategy complexity of finite-horizon MDPs and SSGs. We show that for all ǫ > 0, the natural class of counter-based strategies requi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 7 publications
0
3
0
Order By: Relevance
“…The underpopulation opinion dynamics considered in [14] is a noticeable example of a (family of) local threshold-based opinion dynamics, which corresponds to having, for any u ∈ V, θ + (u) = c + and θ − (u) = c − for some pair of constants c + , c − ∈ N. The underpopulation dynamics ruled by the constants c + and c − will be denoted as…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The underpopulation opinion dynamics considered in [14] is a noticeable example of a (family of) local threshold-based opinion dynamics, which corresponds to having, for any u ∈ V, θ + (u) = c + and θ − (u) = c − for some pair of constants c + , c − ∈ N. The underpopulation dynamics ruled by the constants c + and c − will be denoted as…”
Section: Resultsmentioning
confidence: 99%
“…Goles and Olivos [13] proposed single-threshold deterministic opinion dynamics for weighted graphs in which every node at each step is either in state 0 or in state 1, and at the next step, it obtains state 1 if and only if the sum of the products between the current state of each of its k neighbors with the weight of the edge connecting the neighbor to the individual is at least θ(k) for a given function θ defined on the node set. In [14], a couple of deterministic opinion dynamics models, the overpopulation and theunderpopulation rules, that still work with binary node states are defined for unsigned graphs as a simplification of the Game-of-Life rules [15]; according to the underpopulation rule, nodes' opinion changes are controlled by two constant values, c + and c − . A node gets a positive opinion if either its opinion is already positive and at least c + neighbors have a positive opinion, or its opinion is negative and at least c − neighbors have a positive opinion.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation