Improving modified policy iteration for probabilistic model checking

Mohagheghi, Mohammadsadegh; Karimpour, Jaber

doi:10.7494/csci.2022.23.1.4139

Cited by 3 publications

(4 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The modified policy iteration [1] method updates the policies after a fixed number of iterations (100 iterations, for example) rather than satisfying the convergence criterion. It usually causes faster convergence to the estimations of values [29,32]. More details of modified policy iteration are available in [1,19,31].…”

Section: Modified Policy Iterationmentioning

confidence: 99%

“…Although modified policy iteration is usually faster than the standard policy iteration (Algorithm 1), the number of iterations of the Gauss-Seidel method influences its running time. In [29] a dynamic method has been proposed for determining the number of iterations for every quotient DTMCs.…”

Section: Modified Policy Iterationmentioning

confidence: 99%

“…The RTDP approach and its extensions have also been used in probabilistic model checking to improve the performance of the iterative computations [25] or guarantee the precision of the computed values [15,26]. Graph-based prioritizing methods have been proposed in [27][28][29][30] to provide more appropriate state ordering and avoid useless updates. The main goal of these approaches is to accelerate the convergence to the optimal policy that an intelligent agent should utilize to maximize its performance, or reduce the running time of computing the optimal probability of reaching some special state from the initial one.…”

Section: Introductionmentioning

confidence: 99%

“…The main goal of these approaches is to accelerate the convergence to the optimal policy that an intelligent agent should utilize to maximize its performance, or reduce the running time of computing the optimal probability of reaching some special state from the initial one. Focusing on the policy iteration method as one of the standard iterative approaches for analyzing MDP models, some improved methods have been proposed in [29,31] to avoid useless updates and provide faster convergence to the optimal values. Although in most cases, the proposed prioritizing methods reduce the running time of computations by one or two orders of magnitude, their success depends on the graphical structure of the models and their performance degrades for non-sparse models [30].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

State ordering and classification for analyzing non-sparse large Markov Models

Mohagheghi

2023

Preprint

Self Cite

View full text Add to dashboard Cite

Markov chains and Markov decision processes have been widely used to model the behavior of computer systems with probabilistic aspects. Numerical and iterative methods are commonly used to analyze these models. Many efforts have been made in recent decades to improve the efficiency of these numerical methods. In this paper, focusing on Markov models with non-sparse structure, a new set of heuristics is proposed for prioritizing model states with the aim of reducing the total computation time. In these heuristics, a set of simulation runs are used for statistical analysis of the effect of each state on the required values of the other states. Under this criterion, the priority of each state in updating its values is determined. The proposed heuristics provide a state ordering that improves the value propagation among the states. The proposed methods are also extended for very large models where disk-based techniques are required to analyze the models. Experimental results show that our proposed methods in this paper reduce the running times of the iterative methods for most cases of non-sparse models.

show abstract