A Deep Multi-Agent Reinforcement Learning Approach to Autonomous Separation Assurance

Brittain, Marc; Yang, Xuxi; Wei, Peng

doi:10.48550/arxiv.2003.08353

Cited by 3 publications

(3 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…[27], the authors use the Deep Deterministic Policy Gradient (DDPG) technique to mitigate conflicts in high density scenarios and uncertainties. Brittain et al [28] used a deep multiagent reinforcement learning framework to ensure autonomous separation between aircraft. Dalmau et al [29] used Message Passing Neural Networks (MPNN) to model air traffic control as a multiagent reinforcement learning system where agents must ensure conflict free flight through a sector.…”

Section: Related Workmentioning

confidence: 99%

Multi-UAV Conflict Resolution with Graph Convolutional Reinforcement Learning

2022

View full text Add to dashboard Cite

Safety is the primary concern when it comes to air traffic. In-flight safety between Unmanned Aircraft Vehicles (UAVs) is ensured through pairwise separation minima, utilizing conflict detection and resolution methods. Existing methods mainly deal with pairwise conflicts, however, due to an expected increase in traffic density, encounters with more than two UAVs are likely to happen. In this paper, we model multi-UAV conflict resolution as a multiagent reinforcement learning problem. We implement an algorithm based on graph neural networks where cooperative agents can communicate to jointly generate resolution maneuvers. The model is evaluated in scenarios with 3 and 4 present agents. Results show that agents are able to successfully solve the multi-UAV conflicts through a cooperative strategy.

show abstract

Section: Related Workmentioning

confidence: 99%

Multi-UAV Conflict Resolution with Graph Convolutional Reinforcement Learning

2022

View full text Add to dashboard Cite

show abstract

“…The performance of the DQN algorithm in avoiding single aircraft to multiple aircraft is investigated in [34]. A novel deep multi-agent reinforcement learning framework based on PPO is proposed in [35] to detect and avoid conflicts among multiple aircraft in a highdensity and dynamic sector under uncertainty. The DRL work mentioned above is in continuous state and discrete action space.…”

Section: Introductionmentioning

confidence: 99%

Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning

Yang

Wang

et al. 2022

IEEE Access

Self Cite

View full text Add to dashboard Cite

Obstacle avoidance for small unmanned aircraft is vital for the safety of future urban air mobility (UAM) and Unmanned Aircraft System (UAS) Traffic Management (UTM). There are a variety of techniques for real-time robust drone guidance, but numerous of them solve in discretized airspace and control, which would require an additional path smoothing step to provide flexible commands for UAS. To deliver safe and computationally efficient guidance for UAS operations, we explore the use of a deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) to lead autonomous UAS to their destinations while bypassing obstacles through continuous control. The proposed scenario state representation and reward function can map the continuous state space to continuous control for both heading angle and speed. To verify the effectiveness of the proposed learning framework, we conducted numerical experiments with static and moving obstacles. Uncertainties associated with the environments and safety operation bounds are investigated in detail. Results show that the proposed model is able to provide accurate and robust guidance and resolve conflict with a success rate of over 99%.INDEX TERMS continuous control, deep reinforcement learning, UAS obstacle avoidance, uncertainty.

show abstract

“…The per-formance of the agent in avoiding single up to multiple aircraft by using the DQN algorithm is investigated in Keong et al (2019). Brittain et al (2020) proposed a novel deep multi-agent reinforcement learning framework based on PPO to identify and resolve conflicts among a variable number of aircraft in a high-density, stochastic, and dynamic sector in en-route airspace. The DRL work mentioned above is in continuous state and discrete action space.…”

Section: Introductionmentioning

confidence: 99%