2023
DOI: 10.3390/drones7050297
|View full text |Cite
|
Sign up to set email alerts
|

Task Assignment of UAV Swarms Based on Deep Reinforcement Learning

Abstract: UAV swarm applications are critical for the future, and their mission-planning and decision-making capabilities have a direct impact on their performance. However, creating a dynamic and scalable assignment algorithm that can be applied to various groups and tasks is a significant challenge. To address this issue, we propose the Extensible Multi-Agent Deep Deterministic Policy Gradient (Ex-MADDPG) algorithm, which builds on the MADDPG framework. The Ex-MADDPG algorithm improves the robustness and scalability o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…In the above formula, a k represents the important neighborhood agent for agent j. Then the Q-value of each agent is shown in Equation (17). Note that the Q-value here is a partially observable Q-value.…”
Section: Mean Field Modulementioning
confidence: 99%
See 1 more Smart Citation
“…In the above formula, a k represents the important neighborhood agent for agent j. Then the Q-value of each agent is shown in Equation (17). Note that the Q-value here is a partially observable Q-value.…”
Section: Mean Field Modulementioning
confidence: 99%
“…Shi et al [3] successfully migrated MARL to real-world multi-UAV handling tasks by employing Recurrent Multi-Agent Deep Deterministic Policy Gradient (R-MADDPG) and domain randomization techniques. Liu et al [17] proposed the Extensible Multi-Agent Deep Deterministic Policy Gradient (Ex-MADDPG) algorithm to address dynamic task assignment problems in UAV swarms, in which the practicality and effectiveness of the Ex-MADDPG algorithm were validated using a swarm of nine UAVs.…”
Section: Introductionmentioning
confidence: 99%
“…This approach effectively balances the conflicting objectives of safety and performance within the affine formation in obstacle‐rich environments. By employing this method, agents can navigate through complex environments while maintaining the desired formation, ensuring a harmonious interplay between safety and efficiency. (3)The efficiency and feasibility of our proposed method are validated through simulations and real‐world experiments with a fleet of self‐developed unmanned aerial vehicles called scit‐mini [32]. A video demonstrating the real‐world experiment can be viewed online at https://youtu.be/5eBN9m22Nww.…”
Section: Introductionmentioning
confidence: 99%
“…Static formations of drones are extensively used for various applications 1 , such as those involving continuous aerial observations 2 – 4 , the assignment of different tasks 5 , intruder monitoring 6 , air quality measurements 7 , 8 or the interception of rockets and ballistic missiles 9 . The formation should be maintained even in the presence of unwanted or undesired intruders that introduce a disturbance to the formation.…”
Section: Introductionmentioning
confidence: 99%