Deep reinforcement learning‐based balancing and sequencing approach for mixed model assembly lines

Lv, Youlong; Yuanliang, Tan; Ray, Zhong; Zhang, Peng; Wang, Junliang; Zhang, Jie

doi:10.1049/cim2.12061

Cited by 4 publications

(3 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the area of AL balancing, Li et al [6] focused on the research of balancing AL in the digital domain, and designed a DRLA with the support of deep deterministic policy gradient (DDPG) to enhance the operation and simulation effect of the assembly line digital twin model. Lv et al [7] combined the sequencing problem with the assembly line balance problem and proposed a new version of DRLA on the basis of DDPG, in which an iterative interaction mechanism between task assembly time and station load were designed to achieve production task sequencing and worker allocation layer by layer. The objective was minimizing the work overload.…”

Section: Deep Reinforcement Learningmentioning

confidence: 99%

“…Ct(j, k) represents the completion time of station (j,k), and Ct max = max{Ct(j, k)} is the maximum thereof. Equation (7) indicates that any task can only be assigned to one station. Equations ( 8) and ( 9) show cycle time constraints-that is, the completion time of each station must be less than cycle time.…”

Section: Mathematical Modelmentioning

confidence: 99%

“…Moreover, the deep reinforcement learning algorithm has higher adaptability to the complex and ever-changing production environment, which is easy to adjust so as to alter the solution. In addition, although deep reinforcement learning algorithms have begun to be tried in solving such AL balancing problems [6,7], they are currently used to optimize simulation models and resource allocation. Therefore, this paper carries out the study on load balancing of TAL based on deep reinforcement learning for the first time.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Load Balancing of Two-Sided Assembly Line Based on Deep Reinforcement Learning

et al. 2023

View full text Add to dashboard Cite

In the complex and ever-changing manufacturing environment, maintaining the long-term steady and efficient work of the assembly line is the ultimate goal pursued by relevant enterprises, the foundation of which is a balanced load. Therefore, this paper carries out research on the two-sided assembly line balance problem (TALBP) for load balancing. At first, a mathematical programming model is established with the objectives of optimizing the line efficiency, smoothness index, and completion time smoothness index of the two-sided assembly line (TAL). Secondly, a deep reinforcement learning algorithm combining distributed proximal policy optimization (DPPO) and the convolutional neural network (CNN) is proposed. Based on the distributed reinforcement learning agent structure assisted by the marker layer, the task assignment states of the two-sided assembly and decisions of selecting tasks are defined. Task assignment logic and reward function are designed according to the optimization objectives to guide task selection and assignment. Finally, the performance of the proposed algorithm is verified on the benchmark problem.

show abstract

Section: Deep Reinforcement Learningmentioning

confidence: 99%

Section: Mathematical Modelmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation