2015
DOI: 10.1007/s11425-015-5029-x
|View full text |Cite
|
Sign up to set email alerts
|

A minimization problem of the risk probability in first passage semi-Markov decision processes with loss rates

Abstract: This paper is the first attempt to investigate the risk probability criterion in semi-Markov decision processes with loss rates. The goal is to find an optimal policy with the minimum risk probability that the total loss incurred during a first passage time to some target set exceeds a loss level. First, we establish the optimality equation via a successive approximation technique, and show that the value function is the unique solution to the optimality equation. Second, we give suitable conditions, under whi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
15
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(15 citation statements)
references
References 24 publications
0
15
0
Order By: Relevance
“…It follows from Theorem 1 in [16]. (2) Theorem 3.3 is also called Lyapunov condition, which is weaker than the well known regular condition for SMDPs in [10,12,13,14]. This is because the regular condition for SMDPs means that Q(δ, E | i, a) ≤ 1 − ε, for all (i, a) ∈ K (for some δ and > 0), where Q(δ, E | i, a) is the semi-Markov kernel.…”
Section: Theorem 33mentioning
confidence: 99%
See 4 more Smart Citations
“…It follows from Theorem 1 in [16]. (2) Theorem 3.3 is also called Lyapunov condition, which is weaker than the well known regular condition for SMDPs in [10,12,13,14]. This is because the regular condition for SMDPs means that Q(δ, E | i, a) ≤ 1 − ε, for all (i, a) ∈ K (for some δ and > 0), where Q(δ, E | i, a) is the semi-Markov kernel.…”
Section: Theorem 33mentioning
confidence: 99%
“…(ii) The other is first passage risk probability criterion, which usually refers to the probability of the total rewards does not exceed a reward level (profit goal) during a first passage time that the state process first enters a target set. This paper belongs to the second group for MDPs, which have DOI: 10.14736/kyb-2019-1-0114 been discussed in [10,14,18,21,20,27]. More precisely, [21] consider risk minimizing problems in discrete time Markov decisions processes (DTMDPs) with a target set.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations