2021
DOI: 10.1017/apr.2020.64
|View full text |Cite
|
Sign up to set email alerts
|

On gradual-impulse control of continuous-time Markov decision processes with exponential utility

Abstract: We consider a gradual-impulse control problem of continuous-time Markov decision processes, where the system performance is measured by the expectation of the exponential utility of the total cost. We show, under natural conditions on the system primitives, the existence of a deterministic stationary optimal policy out of a more general class of policies that allow multiple simultaneous impulses, randomized selection of impulses with random effects, and accumulation of jumps. After characterizing the value fun… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…In the future, we will continue to optimize HGA to generate counterexamples for MCPSs against requirements in PCTL* which is a super set of PCTL. At the same time, we would like to extend HGA to generate counterexamples for MCPSs models with continuous-time or hybrid-time stochastic behaviors, such as CTMDPs (Continuoustime Markov Decision Processes), PHA (Probabilistic Hybrid Automata), against CSL (Continuous-time Stochastic Logic) stochastic communication logics [44,45]. Funding: This research was funded by the Singapore-UK Cyber Security of EPSRC under Grant Nos.…”
Section: Discussionmentioning
confidence: 99%
“…In the future, we will continue to optimize HGA to generate counterexamples for MCPSs against requirements in PCTL* which is a super set of PCTL. At the same time, we would like to extend HGA to generate counterexamples for MCPSs models with continuous-time or hybrid-time stochastic behaviors, such as CTMDPs (Continuoustime Markov Decision Processes), PHA (Probabilistic Hybrid Automata), against CSL (Continuous-time Stochastic Logic) stochastic communication logics [44,45]. Funding: This research was funded by the Singapore-UK Cyber Security of EPSRC under Grant Nos.…”
Section: Discussionmentioning
confidence: 99%
“…Pure stationary strategies are the easiest for implementation. They are often sufficient for problems with a single objective, i.e., when J = 0 (see e.g., [9,17]) or when the model satisfies additional structural conditions, such as being convex or atomless (see e.g., [11,14]). The next example demonstrates that without further conditions, pure stationary strategies are often not sufficient for the constrained problem (2.3).…”
mentioning
confidence: 99%