2014
DOI: 10.1109/tsmc.2013.2294155
|View full text |Cite
|
Sign up to set email alerts
|

An MDP Model-Based Reinforcement Learning Approach for Production Station Ramp-Up Optimization: Q-Learning Analysis

Abstract: Ramp-up is a significant bottleneck for the introduction of new or adapted manufacturing systems. The effort and time required to ramp-up a system is largely dependent on the effectiveness of the human decision making process to select the most promising sequence of actions to improve the system to the required level of performance. Although existing work has identified significant factors influencing the effectiveness of rampup, little has been done to support the decision making during the process. This pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 49 publications
(39 citation statements)
references
References 32 publications
0
39
0
Order By: Relevance
“…A special emphasis is made here on the need for adaptive interfaces as well as the importance of utilizing notions of distributed decision support system approaches such as multi-agent systems. Doltsinis et al [20] emphasise on the need for capturing operator's actions during the ramp-up phase. Operator's hardware and software adjustments during the ramp-up phase were recorded and used as part of a reinforcement learning approach to derive generalised policies for adjustment strategies leading to faster ramp-up.…”
Section: Extraction and Integration Of Human Knowledgementioning
confidence: 99%
“…A special emphasis is made here on the need for adaptive interfaces as well as the importance of utilizing notions of distributed decision support system approaches such as multi-agent systems. Doltsinis et al [20] emphasise on the need for capturing operator's actions during the ramp-up phase. Operator's hardware and software adjustments during the ramp-up phase were recorded and used as part of a reinforcement learning approach to derive generalised policies for adjustment strategies leading to faster ramp-up.…”
Section: Extraction and Integration Of Human Knowledgementioning
confidence: 99%
“…Measuring and monitoring the performance of the production system during the ramp-up is critical, as the production process during this phase is unstable and error-prone. As a result, the system needs to be interrupted and adjusted at regular time intervals until a steady-state phase is reached (Doltsinis, Ferreira, and Lohse 2014). To identify deviations from production plans, production managers need to constantly monitor the performance of the production process during the ramp-up (Winkler, Heins, and Nyhuis 2007).…”
Section: Planning Problems During the Ramp-up Phasementioning
confidence: 99%
“…Examples include the determination of production capacities, the re-sequencing or changing of processes, the determination of lot sizes, worker training or the assignment of workers to work stations (Almgren 1999b;Scholz-Reiter et al 2007;Gopal et al 2013;Doltsinis, Ferreira, and Lohse 2014;Li et al 2014). It is clear that if companies solve these planning problems by trial and error instead of following a systematic planning approach, delays and errors may occur, which increase the time to volume and put the success of product introduction at stake (Schuh, Desoi, and Tücks 2005;Doltsinis, Ratchev, and Lohse 2013).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The success of Q-Learning has lead to many applications, such as path planning [25,31], energy management [30], routing in vehicular ad-hoc networks [42], management of water resources [27], and production planning [10].…”
Section: Q-learningmentioning
confidence: 99%