2002
DOI: 10.1081/stm-120002780
|View full text |Cite
|
Sign up to set email alerts
|

Non-ergodic Markov decision processes with a constraint on the asymptotic failure rate: general class of policies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2004
2004
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 2 publications
0
3
0
Order By: Relevance
“…Both of these reinforcement learning methods assume that every Markov chain induced by a policy is irreducible, which allows only a single recurrent class as with ergodic and unichain assumptions described earlier. The Lagrangian approach has also been applied to specific stochastic policy linear programming formulations relevant to aircraft maintenance problems where the asymptotic failure is to be kept below some small threshold (Boussemart & Limnios, 2004;Boussemart et al, 2002).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Both of these reinforcement learning methods assume that every Markov chain induced by a policy is irreducible, which allows only a single recurrent class as with ergodic and unichain assumptions described earlier. The Lagrangian approach has also been applied to specific stochastic policy linear programming formulations relevant to aircraft maintenance problems where the asymptotic failure is to be kept below some small threshold (Boussemart & Limnios, 2004;Boussemart et al, 2002).…”
Section: Related Workmentioning
confidence: 99%
“…Steady-state planning has applications in several areas, such as deriving maintenance plans for various systems, including aircraft maintenance, where the asymptotic failure rate of components must be kept below some small threshold (Boussemart & Limnios, 2004;Boussemart, Limnios, & Fillion, 2002). Optimal routing problems for communication networks have also been proposed in which data throughput must be maximized subject to constraints on average delay and packet drop metrics (Lazar, 1983).…”
Section: Introductionmentioning
confidence: 99%
“…A high level of local instability of the frequency causes a high probability of early/late deliveries that are both undesirable (early deliveries lead to wasting the items that are not consumed before expiration, and late deliveries lead to a shortage of items). • dependability, i.e., an upper bound on failure frequency (see, e.g., (Boussemart and Limnios 2004;Boussemart, Limnios, and Fillion 2002)). If this bound is locally violated with considerable probability, a user may interpret this as a violation of the dependability guarantee.…”
Section: Introductionmentioning
confidence: 99%