<abstract>
<p>To address the multi-flexible integrated scheduling problem with setup times, a multi-flexible integrated scheduling algorithm is put forward. First, the operation optimization allocation strategy, based on the principle of the relatively long subsequent path, is proposed to assign the operations to idle machines. Second, the parallel optimization strategy is proposed to adjust the scheduling of the planned operations and machines to make the processing as parallel as possible and reduce the no-load machines. Then, the flexible operation determination strategy is combined with the above two strategies to determine the dynamic selection of the flexible operations as the planned operations. Finally, a potential operation preemptive strategy is proposed to judge whether the planned operations will be interrupted by other operations during their processing. The results show that the proposed algorithm can effectively solve the multi-flexible integrated scheduling with setup times, and it can also better solve the flexible integrated scheduling problem.</p>
</abstract>
In this paper, a real-time scheduling problem of a dual-resource flexible job shop with robots is studied. Multiple independent robots and their supervised machine sets form their own work cells. First, a mixed integer programming model is established, which considers the scheduling problems of jobs and machines in the work cells, and of jobs between work cells, based on the process plan flexibility. Second, in order to make real-time scheduling decisions, a framework of multi-task multi-agent reinforcement learning based on centralized training and decentralized execution is proposed. Each agent interacts with the environment and completes three decision-making tasks: job sequencing, machine selection, and process planning. In the process of centralized training, the value network is used to evaluate and optimize the policy network to achieve multi-agent cooperation, and the attention mechanism is introduced into the policy network to realize information sharing among multiple tasks. In the process of decentralized execution, each agent performs multiple task decisions through local observations according to the trained policy network. Then, observation, action, and reward are designed. Rewards include global and local rewards, which are decomposed into sub-rewards corresponding to tasks. The reinforcement learning training algorithm is designed based on a double-deep Q-network. Finally, the scheduling simulation environment is derived from benchmarks, and the experimental results show the effectiveness of the proposed method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.