2022
DOI: 10.3390/s22186737
|View full text |Cite
|
Sign up to set email alerts
|

Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning

Abstract: In this paper, a cooperative search method for multiple UAVs is proposed to solve the problem of low efficiency of multi-UAV task execution by using a cooperative game with incomplete information. To improve search efficiency, CBBA (Consensus-Based Bundle Algorithm) is applied to designate the tasks area for each UAV. Then, Independent Deep Reinforcement Learning (IDRL) is used to solve Nash equilibrium to improve UAVs’ collaborations. The proposed reward function is smartly developed to guide UAVs to fly alon… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 35 publications
0
2
0
Order By: Relevance
“…Ground Robots Manipulators [23,24,38,46,53,56,57,68,76,83,85,91,104,106,110,110,120,123,134,135,138,139,141,142,161,167,171,172,186,205,207,[218][219][220]230,240,269] [ 22,39,49,52,54,55,59,70,71,[73][74][75][77][78][79]…”
Section: Aerial Robotsmentioning
confidence: 99%
See 1 more Smart Citation
“…Ground Robots Manipulators [23,24,38,46,53,56,57,68,76,83,85,91,104,106,110,110,120,123,134,135,138,139,141,142,161,167,171,172,186,205,207,[218][219][220]230,240,269] [ 22,39,49,52,54,55,59,70,71,[73][74][75][77][78][79]…”
Section: Aerial Robotsmentioning
confidence: 99%
“…Their algorithm makes use of two modules based on a divide-and-conquer architecture: an environmental sense module that utilizes sensing information and a policy module that is responsible for the optimal policy of the robots. Gao and Zhang [ 161 ] study a cooperative search problem while using MADRL as the solution method. The authors use independent learners on the robots to find the Nash equilibrium solution with the incomplete information available to the robots.…”
Section: Multi-robot System Applications Of Multi-agent Deep Reinforc...mentioning
confidence: 99%