2020
DOI: 10.1016/j.jmsy.2020.07.004
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning for combined production-maintenance and quality control of a manufacturing system with deterioration failures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
26
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 77 publications
(27 citation statements)
references
References 45 publications
1
26
0
Order By: Relevance
“…We have reported the continuous formulation of QWK (for the results, the values reported are generally those from discrete QWK, while the continuous version is used only for the state-of-the-art experimental comparison in the training process [37]) according to Eq. (11).…”
Section: Ordinal Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…We have reported the continuous formulation of QWK (for the results, the values reported are generally those from discrete QWK, while the continuous version is used only for the state-of-the-art experimental comparison in the training process [37]) according to Eq. (11).…”
Section: Ordinal Metricsmentioning
confidence: 99%
“…The application of ML and DL techniques offers great opportunities to automatize the overall QC process [11]. Indeed, these methodologies for QC have been employed in several industrial areas, but state-of-the-art is mainly oriented to present ad hoc rather than vanilla ML solutions capable of dealing with challenges of this domain, namely the intrinsic variability of the annotation procedure and the difficulty to generalize across different sets.…”
Section: Introductionmentioning
confidence: 99%
“…To minimize the makespan for an MCP scheduling problem, the authors of [25] propose a reinforcement learning (RL) algorithm to setup a change scheduling method. [26] discusses a Reinforcement Learning method to find the optimal trade-off between conflicting performance metrics for the optimization of the total expected profit of the system.…”
Section: B Deep Reinforcement Learning Applications In Decision Makin...mentioning
confidence: 99%
“…There have been some research studies that propose distributed dynamic maintenance scheduling where the agents (sub-components) decide about their optimal maintenance individually, such as [2], [30], and [23].…”
Section: Introductionmentioning
confidence: 99%
“…The authors consider finite discrete values for the degradation state and solve the problem using Q-learning. The authors of [23] propose a dynamic maintenance and production scheduling using a RL approach by considering one agent with discrete action space. These studies do not consider coupling constraints among the agents that ensure that the total demand is fulfilled.…”
Section: Introductionmentioning
confidence: 99%