Proceedings of the 6th ACM Workshop on Micro Aerial Vehicle Networks, Systems, and Applications 2020
DOI: 10.1145/3396864.3399701
|View full text |Cite
|
Sign up to set email alerts
|

Distributed reinforcement learning for flexible UAV swarm control with transfer learning capabilities

Abstract: Over the past few years, the use of swarms of Unmanned Aerial Vehicles (UAVs) in monitoring and remote area surveillance applications has become widespread thanks to the price reduction and the increased capabilities of drones. The drones in the swarm need to cooperatively explore an unknown area, in order to identify and monitor interesting targets, while minimizing their movements. In this work, we propose a distributed Reinforcement Learning (RL) approach that scales to larger swarms without modifications. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(12 citation statements)
references
References 22 publications
0
11
0
Order By: Relevance
“…Other MARL path planning approaches to minimize the age of information of collected data include [12] and [13]. In [24], a swarm of UAVs on a target detection and tracking mission in an unknown environment is controlled through a distributed DQN approach. While the authors also use convolutional processing to feed map information to the agents, the map is initially unknown and has to be explored to detect the targets.…”
Section: A Related Workmentioning
confidence: 99%
“…Other MARL path planning approaches to minimize the age of information of collected data include [12] and [13]. In [24], a swarm of UAVs on a target detection and tracking mission in an unknown environment is controlled through a distributed DQN approach. While the authors also use convolutional processing to feed map information to the agents, the map is initially unknown and has to be explored to detect the targets.…”
Section: A Related Workmentioning
confidence: 99%
“…The UAV route can be optimised to provide maximum area coverage of the area in minimum time and cost [83] (Figure 6). The UAV swarm will collect the images of the region and provide the gathered data to the control centre [85][86][87][88]. Recently, Albani et al [85] applied a macroscopic model for monitoring an area using UAVs.…”
Section: Rq-4 How Can the Authorities Improve The Existing Flood Management Operation With Cutting-edge Technologies?mentioning
confidence: 99%
“…The UAV swarm will collect the images of the region and provide the gathered data to the control centre [85][86][87][88]. Recently, Albani et al [85] applied a macroscopic model for monitoring an area using UAVs. Parametriasition was proposed for efficient allocation of the UAVs; abstract multiple-agent simulations were conducted to deploy UAVs in multiple areas, and simulation of UAV swarm was carried out for mapping the areas.…”
Section: Rq-4 How Can the Authorities Improve The Existing Flood Management Operation With Cutting-edge Technologies?mentioning
confidence: 99%
“…In the work of Venturini et al [115], the authors considered a general MARL framework for the initial exploration and surveillance of a swarm of independent UAVs. Their scheme followed the framework in which observations of other agents are used to make decisions and to avoid collision, thereby encouraging cooperation.…”
Section: Flocking Strategies and Uav Coordinationmentioning
confidence: 99%
“…Challenge Optimization ML Criteria Method Olfati-Saber [81] Flocking challenges connectivity, energy Distributed flocking algorithms Maza et al [76], [77] Architecture execution cost Temporal planning, contract net Quintero [92] Localization distance and heading DP Xu et al [122] Ensuring flocking rules cohesion, separation, NDP and alignment Hung and Givigi [49] Coordination flocking cost function Q-learning Tsai [113] Vision-based collision HTER RNN avoidance Jafrai et al [52] Flocking design multi-objective properties Bio-inspired RL Venturini et al [115] Exploration and target reaching efficiency Deep Q-learning surveillance Anicho et al [6] Coordinating coverage RL and SI Sharma and Ghose [104] Collision avoidance swarm size and stability Swarm laws Decentralized alg. ABSs with a limited coverage range.…”
Section: Publicationmentioning
confidence: 99%