Distributed reinforcement learning for flexible UAV swarm control with transfer learning capabilities

Venturini, Federico; Mason, Federico; Pase, Francesco; Chiariotti, Federico; Testolin, Alberto; Zanella, Andréa; Zorzi, Michele

doi:10.1145/3396864.3399701

Cited by 17 publications

(12 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other MARL path planning approaches to minimize the age of information of collected data include [12] and [13]. In [24], a swarm of UAVs on a target detection and tracking mission in an unknown environment is controlled through a distributed DQN approach. While the authors also use convolutional processing to feed map information to the agents, the map is initially unknown and has to be explored to detect the targets.…”

Section: A Related Workmentioning

confidence: 99%

Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning

Bayerlein

Theile

Caccamo

et al. 2021

IEEE Open J. Commun. Soc.

108

View full text Add to dashboard Cite

Harvesting data from distributed Internet of Things (IoT) devices with multiple autonomous unmanned aerial vehicles (UAVs) is a challenging problem requiring flexible path planning methods. We propose a multi-agent reinforcement learning (MARL) approach that, in contrast to previous work, can adapt to profound changes in the scenario parameters defining the data harvesting mission, such as the number of deployed UAVs, number, position and data amount of IoT devices, or the maximum flying time, without the need to perform expensive recomputations or relearn control policies. We formulate the path planning problem for a cooperative, non-communicating, and homogeneous team of UAVs tasked with maximizing collected data from distributed IoT sensor nodes subject to flying time and collision avoidance constraints. The path planning problem is translated into a decentralized partially observable Markov decision process (Dec-POMDP), which we solve through a deep reinforcement learning (DRL) approach, approximating the optimal UAV control policy without prior knowledge of the challenging wireless channel characteristics in dense urban environments. By exploiting a combination of centered global and local map representations of the environment that are fed into convolutional layers of the agents, we show that our proposed network architecture enables the agents to cooperate effectively by carefully dividing the data collection task among themselves, adapt to large complex environments and state spaces, and make movement decisions that balance data collection goals, flight-time efficiency, and navigation constraints. Finally, learning a control policy that generalizes over the scenario parameter space enables us to analyze the influence of individual parameters on collection performance and provide some intuition about system-level benefits.Index Terms-Internet of Things (IoT), map-based planning, multi-agent reinforcement learning (MARL), trajectory planning, unmanned aerial vehicle (UAV).

show abstract

Section: A Related Workmentioning

confidence: 99%

Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning

Bayerlein

Theile

Caccamo

et al. 2021

IEEE Open J. Commun. Soc.

108

View full text Add to dashboard Cite

show abstract

“…The UAV route can be optimised to provide maximum area coverage of the area in minimum time and cost [83] (Figure 6). The UAV swarm will collect the images of the region and provide the gathered data to the control centre [85][86][87][88]. Recently, Albani et al [85] applied a macroscopic model for monitoring an area using UAVs.…”

Section: Rq-4 How Can the Authorities Improve The Existing Flood Management Operation With Cutting-edge Technologies?mentioning

confidence: 99%

“…The UAV swarm will collect the images of the region and provide the gathered data to the control centre [85][86][87][88]. Recently, Albani et al [85] applied a macroscopic model for monitoring an area using UAVs. Parametriasition was proposed for efficient allocation of the UAVs; abstract multiple-agent simulations were conducted to deploy UAVs in multiple areas, and simulation of UAV swarm was carried out for mapping the areas.…”

Section: Rq-4 How Can the Authorities Improve The Existing Flood Management Operation With Cutting-edge Technologies?mentioning

confidence: 99%

An Integrated Approach for Post-Disaster Flood Management Via the Use of Cutting-Edge Technologies and UAVs: A Review

et al. 2021

View full text Add to dashboard Cite

Rapid advances that improve flood management have facilitated the disaster response by providing first aid services, finding safe routes, maintaining communication and developing flood maps. Different technologies such as image processing, satellite imagery, synthetic imagery and integrated approaches have been extensively analysed in the literature for disaster operations. There is a need to review cutting-edge technologies for flood management. This paper presents a review of the latest advancements in the flood management domain based on image processing, artificial intelligence and integrated approaches with a focus on post-disaster. It answers the following research questions: (1) What are the latest developments in image processing for flood management in a post-disaster scenario? (2) What are the latest techniques for flood management based on artificial intelligence in a post-disaster scenario? (3) What are the existing gaps in the selected technologies for post-disaster? (4) How can the authorities improve the existing post-disaster management operation with cutting-edge technologies? A novel framework has been proposed to optimise flood management with the application of a holistic approach.

show abstract

“…In the work of Venturini et al [115], the authors considered a general MARL framework for the initial exploration and surveillance of a swarm of independent UAVs. Their scheme followed the framework in which observations of other agents are used to make decisions and to avoid collision, thereby encouraging cooperation.…”

Section: Flocking Strategies and Uav Coordinationmentioning

confidence: 99%

“…Challenge Optimization ML Criteria Method Olfati-Saber [81] Flocking challenges connectivity, energy Distributed flocking algorithms Maza et al [76], [77] Architecture execution cost Temporal planning, contract net Quintero [92] Localization distance and heading DP Xu et al [122] Ensuring flocking rules cohesion, separation, NDP and alignment Hung and Givigi [49] Coordination flocking cost function Q-learning Tsai [113] Vision-based collision HTER RNN avoidance Jafrai et al [52] Flocking design multi-objective properties Bio-inspired RL Venturini et al [115] Exploration and target reaching efficiency Deep Q-learning surveillance Anicho et al [6] Coordinating coverage RL and SI Sharma and Ghose [104] Collision avoidance swarm size and stability Swarm laws Decentralized alg. ABSs with a limited coverage range.…”

Section: Publicationmentioning

confidence: 99%

Machine Learning Methods for UAV Flocks Management-A Survey

Azoulay¹,

Haddad

Reches

2021

IEEE Access

View full text Add to dashboard Cite

The development of unmanned aerial vehicles (UAVs) has been gaining momentum in recent years owing to technological advances and a significant reduction in their cost. UAV technology can be used in a wide range of domains, including communication, agriculture, security, and transportation. It may be useful to group the UAVs into clusters/flocks in certain domains, and various challenges associated with UAV usage can be alleviated by clustering. Several computational challenges arise in UAV flock management, which can be solved by using machine learning (ML) methods. In this survey, we describe the basic terms relating to UAVS and modern ML methods, and we provide an overview of related tutorials and surveys. We subsequently consider the different challenges that appear in UAV flocks. For each issue, we survey several machine learning-based methods that have been suggested in the literature to handle the associated challenges. Thereafter, we describe various open issues in which ML can be applied to solve the different challenges of flocks, and we suggest means of using ML methods for this purpose. This comprehensive review may be useful for both researchers and developers in providing a wide view of various aspects of state-of-the-art ML technologies that are applicable to flock management.

show abstract

Distributed reinforcement learning for flexible UAV swarm control with transfer learning capabilities

Cited by 17 publications

References 22 publications

Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning

Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning

An Integrated Approach for Post-Disaster Flood Management Via the Use of Cutting-Edge Technologies and UAVs: A Review

Machine Learning Methods for UAV Flocks Management-A Survey

Contact Info

Product

Resources

About