2020
DOI: 10.1080/08982112.2020.1766692
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning for dynamic condition-based maintenance of a system with individually repairable components

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(11 citation statements)
references
References 33 publications
0
11
0
Order By: Relevance
“…Adsule et al [1], modeled the CBM decision-making problem as a continuous semi-Markov decision process (CSMDP), and applied an RL algorithm. Yousefi et al [9], modeled the CBM decision-making problem as an MDP and also used an RL algorithm. Peng et al [10], modeled the problem of CBM as a continuous Markov decision-making process without discretizing the degradation states under a Gaussian process (GP) and then applied an RL algorithm.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…Adsule et al [1], modeled the CBM decision-making problem as a continuous semi-Markov decision process (CSMDP), and applied an RL algorithm. Yousefi et al [9], modeled the CBM decision-making problem as an MDP and also used an RL algorithm. Peng et al [10], modeled the problem of CBM as a continuous Markov decision-making process without discretizing the degradation states under a Gaussian process (GP) and then applied an RL algorithm.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The feedback is usually termed as a reward. The agent's goal (objective) is to maximize cumulative rewards by learning to perform better [7].An MDP usually describes the environment, consisting of a state space, an action space, a reward function, and state transition probabilities.Therefore, MDP for an RL problem has the following components [11,17,9].…”
Section: Introductionmentioning
confidence: 99%
“…For example, a four‐state MDP has been used to model CBM for multi‐component systems with individual reparable components. The authors have used RL to find an optimal maintenance action for each of the components [3].…”
Section: Literature Reviewmentioning
confidence: 99%
“…There have been some research studies that propose distributed dynamic maintenance scheduling where the agents (sub-components) decide about their optimal maintenance individually, such as [2], [30], and [23].…”
Section: Introductionmentioning
confidence: 99%
“…Aissani et al [2] propose a multi-agent maintenance scheduling in a petroleum system using reinforcement learning (RL). RL is also used in [30] to obtain maintenance decisions for sub-components. The authors consider finite discrete values for the degradation state and solve the problem using Q-learning.…”
Section: Introductionmentioning
confidence: 99%