To solve the dynamic and real-time problem of multirobot task allocation in intelligent warehouse system under parts-to-picker mode, this paper presents a combined solution based on adaptive task pool strategy and Covariance Matrix Adaptation Evolutionary Strategy (CMA-ES) algorithm. In the first stage of the solution, a variable task pool is used to store dynamically added tasks, which can dynamically divide continuous and large-scale task allocation problems into small-scale subproblems to solve them to meet dynamic requirements. And an adaptive control strategy is used to automatically adjust the total number of tasks in the task pool to achieve a trade-off among throughput, energy consumption, and waiting time, which has better adaptability than manually adjusting the size of the task pool. In the second stage of the solution, when the task pool is full, tasks in the task pool will be assigned to robots. For the task allocation problem, this paper regards it as an optimization problem and uses the CMA-ES algorithm to find the optimal task assignment solution for all the robots. By comparing with fixed threshold method under 56 different task pool sizes, the experimental results show that the throughput can be close to reaching the optimal level, and the average distance traveled by robots to handle each unit is lower using adaptive threshold method; so, adaptive task pool solution has better adaptability and can find the optimal task pool size by itself. This method can satisfy the dynamic and real-time requirements and can be effectively applied to the intelligent warehouse system.
Aiming at the target encirclement problem of multi‐robot systems, a target hunting control method based on reinforcement learning is proposed. First, the Markov game modeling for the multi‐robot system is carried out. According to the task of hunting, potential energy models are designed to meet the requirements of arriving at the desired state and avoiding obstacles. The multi‐robot reinforcement learning algorithm guided by the potential energy models is presented to perform the hunting, where reinforcement learning principles are combined with the model control. Secondly, based on the potential energy models, the target‐tracking hunting strategy and the target‐circumnavigation hunting strategy are established. In the former, the consensus tracking of multi‐robot systems is achieved by designing the velocity potential energy function. And in the latter, virtual circumnavigation points are added to construct the potential energy function, which realizes the desired circumnavigation. Finally, the effectiveness of target hunting control based on the multi‐robot reinforcement learning method is verified by simulation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.