2022
DOI: 10.1109/lwc.2022.3195963
|View full text |Cite
|
Sign up to set email alerts
|

A Data-Driven Packet Routing Algorithm for an Unmanned Aerial Vehicle Swarm: A Multi-Agent Reinforcement Learning Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…The complexity is greatly reduced compared to the high complexity of centralized routing algorithms facing large-scale networks. Similarly, Qiu et al [102] continue to adapt multi-intelligent reinforcement learning to the complex and variable FANET environment by using LSTMs instead of the fully connected layers of actor-critic networks. The algorithm starts with an LSTM-based behavioral network that performs an output operation based on the current state, followed by an intelligent body that operates, receives a reward and evolves to a new state, and finally updates the parameters of the judging network using a loss function and updates the parameters of the behavioral network using a gradient ascent algorithm.…”
Section: Environment Agentmentioning
confidence: 99%
“…The complexity is greatly reduced compared to the high complexity of centralized routing algorithms facing large-scale networks. Similarly, Qiu et al [102] continue to adapt multi-intelligent reinforcement learning to the complex and variable FANET environment by using LSTMs instead of the fully connected layers of actor-critic networks. The algorithm starts with an LSTM-based behavioral network that performs an output operation based on the current state, followed by an intelligent body that operates, receives a reward and evolves to a new state, and finally updates the parameters of the judging network using a loss function and updates the parameters of the behavioral network using a gradient ascent algorithm.…”
Section: Environment Agentmentioning
confidence: 99%
“…However, the existing UAVs are inadequately intelligent for performing difficult activities, and still major of them require people's real-time control [3]. A single UAV could only execute moderately easy tasks, nonetheless, the UAV set could effectively perform several laborious and complex tasks after acceptable task planning [4]. The task distribution issue is identical to the combinatorial optimizer decision issue for many UAVs.…”
mentioning
confidence: 99%