2018
DOI: 10.1109/access.2017.2771827
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning-Based and Parametric Production-Maintenance Control Policies for a Deteriorating Manufacturing System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 64 publications
(17 citation statements)
references
References 27 publications
0
17
0
Order By: Relevance
“…Reinforcement learning explores the space of feasible solutions efficiently and effectively [7] and therefore has been actively applied to solve various optimization problems as alternatives of metaheuristic and rule-based approaches [8][9][10][11][12][13][14]. For example, Dou et al [8] proposed a path planning method for mobile robots in intelligent warehouses based on Q-learning.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning explores the space of feasible solutions efficiently and effectively [7] and therefore has been actively applied to solve various optimization problems as alternatives of metaheuristic and rule-based approaches [8][9][10][11][12][13][14]. For example, Dou et al [8] proposed a path planning method for mobile robots in intelligent warehouses based on Q-learning.…”
Section: Introductionmentioning
confidence: 99%
“…As maintenance work orders can be modeled as a sequential decisionmaking problem given machine states and system states, RL has also been used in Refs. [56][57][58][59][60] to obtain near-optimal maintenance policies for manufacturing systems. Many studies [57,58,60] integrate multi-agent-based learning and control in their methods, helping dismantle complex structural and operational dependencies among components.…”
Section: Job Dispatching and Schedulingmentioning
confidence: 99%
“…Their findings show that the combined heuristic provides better prediction results and training times than existing algorithms. Similarly, two conflicting objective functions, inventory level and the number of backorders, were combined in a reinforcement learning model to obtain optimal production/maintenance control policy [28].…”
Section: Related Literature a Inventory Models With Backordersmentioning
confidence: 99%