Richard Nai scite author profile

Bayerlein

et al. 2020

Coverage path planning (CPP) is a critical problem in robotics, where the goal is to find an efficient path that covers every point in an area of interest. This work addresses the power-constrained CPP problem with recharge for batterylimited unmanned aerial vehicles (UAVs). In this problem, a notable challenge emerges from integrating recharge journeys into the overall coverage strategy, highlighting the intricate task of making strategic, long-term decisions. We propose a novel proximal policy optimization (PPO)-based deep reinforcement learning (DRL) approach with map-based observations, utilizing action masking and discount factor scheduling to optimize coverage trajectories over the entire mission horizon. We further provide the agent with a position history to handle emergent state loops caused by the recharge capability. Our approach outperforms a baseline heuristic, generalizes to different target zones and maps, with limited generalization to unseen maps. We offer valuable insights into DRL algorithm design for longhorizon problems and provide a publicly available software framework for the CPP problem.

show abstract

UAV Path Planning using Global and Local Map Information with Deep Reinforcement Learning

Bayerlein

et al. 2021

In reinforcement learning (RL), exploiting environmental symmetries can significantly enhance efficiency, robustness, and performance. However, ensuring that the deep RL policy and value networks are respectively equivariant and invariant to exploit these symmetries is a substantial challenge. Related works try to design networks that are equivariant and invariant by construction, limiting them to a very restricted library of components, which in turn hampers the expressiveness of the networks. This paper proposes a method to construct equivariant policies and invariant value functions without specialized neural network components, which we term equivariant ensembles. We further add a regularization term for adding inductive bias during training. In a map-based path planning case study, we show how equivariant ensembles and regularization benefit sample efficiency and performance.

show abstract

uavAP: A Modular Autopilot Framework for UAVs

Dantsker

et al. 2020

uavEE: A Modular, Power-Aware Emulation Environment for Rapid Prototyping and Testing of UAVs

Dantsker

et al. 2018

Evaluation of Communication Scenarios Inside the Electrical Power System

Codetta‐Raiteri

International Journal of Modelling and Simulation

2010