2021
DOI: 10.1109/tccn.2021.3063170
|View full text |Cite
|
Sign up to set email alerts
|

Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control

Abstract: Over the past few years, the use of swarms of Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become widespread thanks to the price reduction and the increased capabilities of drones. The drones in the swarm need to cooperatively explore an unknown area, in order to identify and monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales to larger swarms without modifications. … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(13 citation statements)
references
References 27 publications
0
13
0
Order By: Relevance
“…The infrastructural damages are usually assessed using the fragility curves predictions that are built on statistical and historical data. However, now, the utilization of Random Forest, One-shot learning, neural networks, and Support Vector Machine (SVM)-based AI methods can be used effectively in the detection of damages during a flood event [103]. Particularly, for the aged care facilities, the application of AI techniques will not only assist in damage detection but will also help in obtaining the forecasts of upcoming flood events and the provision of early warning for the safe evacuation of the aged care facilities.…”
Section: Flood Risk Management and Evacuation Strategies For The Aged Care Facilitiesmentioning
confidence: 99%
“…The infrastructural damages are usually assessed using the fragility curves predictions that are built on statistical and historical data. However, now, the utilization of Random Forest, One-shot learning, neural networks, and Support Vector Machine (SVM)-based AI methods can be used effectively in the detection of damages during a flood event [103]. Particularly, for the aged care facilities, the application of AI techniques will not only assist in damage detection but will also help in obtaining the forecasts of upcoming flood events and the provision of early warning for the safe evacuation of the aged care facilities.…”
Section: Flood Risk Management and Evacuation Strategies For The Aged Care Facilitiesmentioning
confidence: 99%
“…Figure 1 plots the accumulated reward J(ω t ) v.s. communication and sample complexity 3 . Each curve includes 10 repeated experiments, and its upper and lower envelopes denote the 95% and 5% percentiles of the 10 repetitions, respectively.…”
Section: Methodsmentioning
confidence: 99%
“…Multi-agent reinforcement learning (MARL) has achieved great success in various application domains, including control [1][2][3], robotics [4], wireless sensor networks [5,6], intelligent systems [7], etc. In MARL, a set of fully decentralized agents interact with a dynamic environment following their own policies and collect local rewards, and their goal is to collaboratively learn the optimal joint policy that achieves the maximum expected accumulated reward.…”
Section: Introductionmentioning
confidence: 99%
“…In [24], a distributed reinforcement learning (RL) approach is proposed with an algorithmic framework that relies on the possibility of drones exchanging some information through communication channels to achieve context awareness and implicitly coordinate the actions of UAV swarms. In [25], an end-to-end collaborative multiagent reinforcement learning (MARL) scheme is presented that enables UAVs to make intelligent flight decisions for collaborative target tracking based on the past and current state of the target.…”
Section: Related Workmentioning
confidence: 99%