Solving Ergodic Markov Decision Processes and Perfect Information Zero-sum Stochastic Games by Variance Reduced Deflated Value Iteration

Akian, Marianne; Gaubert, Stéphane; Qu, Zheng; Saadi, Omar

doi:10.1109/cdc40024.2019.9029885

Search citation statements

Order By: Relevance

Paper Sections

Select...

The Different Convergence Phases1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2022

Publication Types

Select...

Other1

Book1

Relationship

Self Cite1

Independent1

Authors

Journals

Cited by 2 publications

(1 citation statement)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…1 and Th .2], using tools from non-linear Perron-Frobenius theory [AGN11]. In particular, we refer to [AGQS19] for background on weighted sup-norms.…”

Section: The Different Convergence Phasesmentioning

confidence: 99%

Computing Transience Bounds of Emergency Call Centers: a Hierarchical Timed Petri Net Approach

Allamigeon¹,

Boyet²,

Gaubert³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

A fundamental issue in the analysis of emergency call centers is to estimate the time needed to return to a congestion-free regime after an unusual event with a massive arrival of calls. Call centers can generally be represented by timed Petri nets with a hierarchical structure, in which several layers describe the successive steps of treatments of calls. We study a continuous approximation of the Petri net dynamics (with infinitesimal tokens). Then, we show that a counter function, measuring the deviation to the stationary regime, coincides with the value function of a semi-Markov decision problem. Then, we establish a finite time convergence result, exploiting the hierarchical structure of the Petri net. We obtain an explicit bound for the transience time, as a function of the initial marking and sojourn times. This is based on methods from the theory of stochastic shortest paths and non-linear Perron-Frobenius theory. We illustrate the bound on a case study of a medical emergency call center.

show abstract

“…1 and Th .2], using tools from non-linear Perron-Frobenius theory [AGN11]. In particular, we refer to [AGQS19] for background on weighted sup-norms.…”

Section: The Different Convergence Phasesmentioning

confidence: 99%