2022
DOI: 10.1142/s2301385023410029
|View full text |Cite
|
Sign up to set email alerts
|

UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning

Abstract: Focusing on the problem of multi-UAV cooperative air combat decision-making, a multi-UAV cooperative maneuvering decision-making approach is proposed based on multi-agent deep reinforcement learning (MARL) theory. First, the multi-UAV cooperative short-range air combat environment is established. Then, by combining the value-decomposition networks (VDNs) deep reinforcement learning theory with the embedded expert collaborative air combat experience reward function, an air combat cooperative strategy framework … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…Popular multi-agent reinforcement learning algorithms integrate value-learning and strategy-learning structures. It is worth noting that reference [14] uses the multi-agent depth strategy gradient algorithm to solve the challenges brought by the complex and uncertain dynamic environment encountered in the UAV cluster confrontation process. Reference [15] proposes a multi-agent cooperative combat simulation algorithm based on reinforcement learning to achieve a balanced decision-making method.…”
Section: Introductionmentioning
confidence: 99%
“…Popular multi-agent reinforcement learning algorithms integrate value-learning and strategy-learning structures. It is worth noting that reference [14] uses the multi-agent depth strategy gradient algorithm to solve the challenges brought by the complex and uncertain dynamic environment encountered in the UAV cluster confrontation process. Reference [15] proposes a multi-agent cooperative combat simulation algorithm based on reinforcement learning to achieve a balanced decision-making method.…”
Section: Introductionmentioning
confidence: 99%
“…where X ⊂ R p is a convex and compact feasible set. Problems of the form (1) lie at the heart of machine learning and adaptive filtering, emerging in e.g., clustering, classification, energy management, and resource allocation [8]- [11].…”
Section: Introductionmentioning
confidence: 99%