Designing an adaptive production control system using reinforcement learning

Kuhnle, Andreas; Kaiser, Jan-Philipp; Theiß, Felix; Stricker, Nicole; Lanza, Gisela

doi:10.1007/s10845-020-01612-y

Cited by 99 publications

(41 citation statements)

References 55 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…A challenge of manufacturing today is adapting to an increasingly fluctuating environment and diverse changes to meet the demands of the market. Product life cycles are getting shorter while production batch sizes are getting smaller with dynamic product variants associated with increasing complexity, which is challenging the traditional production systems (Benabdellah et al, 2019 ; Kuhnle et al, 2021 ; Ma et al, 2017 ; Prinz et al, 2019 ; Windt et al, 2008 ; Zhu et al, 2015 ). To manage these dynamics, the industrial concept of Industry 4.0 has come about and has been accepted in both research and industry, a trend linked to digitalization and smart systems that could enable factories to achieve higher production variety with reduced downtimes while improving yield, quality, safety, and decreasing cost and energy consumption (García-Magro & Soriano-Pinar, 2019 ; Järvenpää et al, 2019 ; Napoleone et al, 2020 ; Oztemel & Gursev, 2020 ; Park & Tran, 2014 ).…”

Section: Introductionmentioning

confidence: 99%

Human-centred design in industry 4.0: case study review and opportunities for future research

2021

View full text Add to dashboard Cite

The transition to industry 4.0 has impacted factories, but it also affects the entire value chain. In this sense, human-centred factors play a core role in transitioning to sustainable manufacturing processes and consumption. The awareness of human roles in Industry 4.0 is increasing, as evidenced by active work in developing methods, exploring influencing factors, and proving the effectiveness of design oriented to humans. However, numerous studies have been brought into existence but then disconnected from other studies. As a consequence, these studies in industry and research alike are not regularly adopted, and the network of studies is seemingly broad and expands without forming a coherent structure. This study is a unique attempt to bridge the gap through the literature characteristics and lessons learnt derived from a collection of case studies regarding human-centred design (HCD) in the context of Industry 4.0. This objective is achieved by a well-rounded systematic literature review whose special unit of analysis is given to the case studies, delivering contributions in three ways: (1) providing an insight into how the literature has evolved through the cross-disciplinary lens; (2) identifying what research themes associated with design methods are emerging in the field; (3) and setting the research agenda in the context of HCD in Industry 4.0, taking into account the lessons learnt, as uncovered by the in-depth review of case studies.

show abstract

Section: Introductionmentioning

confidence: 99%

Human-centred design in industry 4.0: case study review and opportunities for future research

2021

View full text Add to dashboard Cite

show abstract

“…Kuhnle et al detailed that the implementation outperformed existing benchmark heuristics. In a succeeding work from Kuhnle et al (2021), they focused on state, action, and reward designs in RL production control and concluded their importance for successful learning. Even if the semiconductor example is not directly transferable to our work, inspiration about the state and action representation, as well as the reward function, and other setup parameters can be generated, especially from their 2021 publication.…”

Section: Value-based Drl Methodsmentioning

confidence: 99%

“…For instance, if a product is loaded on an AGV, another pickup-action is not valid, just a dropdown-action is meaningful. This definition orients itself on May et al (2021) and Kuhnle et al (2021). A dropdown action is also not valid if a machine and its respective buffers are already full or have reached a fixed capacity limit of n dl being set as a parameter.…”

Section: Action Designmentioning

confidence: 99%

Modular production control using deep reinforcement learning: proximal policy optimization

2021

View full text Add to dashboard Cite

EU regulations on $$\textit{CO}_2$$ CO 2 limits and the trend of individualization are pushing the automotive industry towards greater flexibility and robustness in production. One approach to address these challenges is modular production, where workstations are decoupled by automated guided vehicles, requiring new control concepts. Modular production control aims at throughput-optimal coordination of products, workstations, and vehicles. For this np-hard problem, conventional control approaches lack in computing efficiency, do not find optimal solutions, or are not generalizable. In contrast, Deep Reinforcement Learning offers powerful and generalizable algorithms, able to deal with varying environments and high complexity. One of these algorithms is Proximal Policy Optimization, which is used in this article to address modular production control. Experiments in several modular production control settings demonstrate stable, reliable, optimal, and generalizable learning behavior. The agent successfully adapts its strategies with respect to the given problem configuration. We explain how to get to this learning behavior, especially focusing on the agent’s action, state, and reward design.

show abstract

“…Recently published work include the optimization of process control in sheet metal milling (Veeramani et al 2019), polymerization reaction systems (Ma et al 2019), laser welding (Günther et al 2016) and in deep drawing (Dornheim et al 2019). Operational optimization objects are amongst others material flow in industrial mining (Kumar et al 2020), preventive maintenance scheduling of flow line systems (Wang et al 2016) and job shop scheduling (Kuhnle et al 2020).…”

Section: Contributionmentioning

confidence: 99%

Deep reinforcement learning methods for structure-guided processing path optimization

et al. 2021

View full text Add to dashboard Cite

A major goal of materials design is to find material structures with desired properties and in a second step to find a processing path to reach one of these structures. In this paper, we propose and investigate a deep reinforcement learning approach for the optimization of processing paths. The goal is to find optimal processing paths in the material structure space that lead to target-structures, which have been identified beforehand to result in desired material properties. There exists a target set containing one or multiple different structures, bearing the desired properties. Our proposed methods can find an optimal path from a start structure to a single target structure, or optimize the processing paths to one of the equivalent target-structures in the set. In the latter case, the algorithm learns during processing to simultaneously identify the best reachable target structure and the optimal path to it. The proposed methods belong to the family of model-free deep reinforcement learning algorithms. They are guided by structure representations as features of the process state and by a reward signal, which is formulated based on a distance function in the structure space. Model-free reinforcement learning algorithms learn through trial and error while interacting with the process. Thereby, they are not restricted to information from a priori sampled processing data and are able to adapt to the specific process. The optimization itself is model-free and does not require any prior knowledge about the process itself. We instantiate and evaluate the proposed methods by optimizing paths of a generic metal forming process. We show the ability of both methods to find processing paths leading close to target structures and the ability of the extended method to identify target-structures that can be reached effectively and efficiently and to focus on these targets for sample efficient processing path optimization.

show abstract

Designing an adaptive production control system using reinforcement learning

Cited by 99 publications

References 55 publications

Human-centred design in industry 4.0: case study review and opportunities for future research

Human-centred design in industry 4.0: case study review and opportunities for future research

Modular production control using deep reinforcement learning: proximal policy optimization

Deep reinforcement learning methods for structure-guided processing path optimization

Contact Info

Product

Resources

About