Learning to Navigate Through Reinforcement Across the Sim2Real Gap

Azzam, Rana; Chehadeh, Mohamad; Abdulhay, Oussama; Boiko, Igor; Zweiri, Yahya

doi:10.36227/techrxiv.20138960

Cited by 2 publications

(1 citation statement)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, the training environment and application scenarios of UAVs usually have a serious distribution mismatch problem, resulting in agents trained in the training scenarios having poor adaptability in the application scenarios of UAV. The above problems make it difficult to use DRL for UAV autonomous navigation and obstacle avoidance [18][19][20]. Therefore, how to increase the autonomous navigation performance of UAVs from the training environment to the application environment is a work of great significance and challenge.…”

Section: Introductionmentioning

confidence: 99%

A framework for improving UAV decision of autonomous navigation from training to application migration under perceptual uncertainty

Sheng,

Liu,

et al. 2024

Meas. Sci. Technol.

View full text Add to dashboard Cite

Unmanned Aerial Vehicles (UAVs) autonomous navigation based on reinforcement learning (RL) usually requires training agents in simulation scenarios and then transferring the trained agents to application scenarios. However, due to serious distribution mismatch between the idealized simulation scenario and the application environment and the inevitable uncertainty perception problem of airborne sensors in complex scenarios, the navigation performance of UAV under migration applications is not ideal. This work fully analyzes the factors that affect UAV navigation performance, including algorithm performance, training strategy, and state awareness. Based on the analysis results, this article proposes a framework to improve the autonomous navigation performance of UAVs in the migration process from training to application, which consists of three parts: "scenario-perception-algorithm". In addition, this paper proposes improvement strategies for each part from the perspectives of spatial features, temporal features, and perceptual denoising. We combine the proposed framework with navigation algorithms to improve the navigation decision-making performance of UAVs in migration applications under uncertainty perception. Many simulation experiments demonstrate the effectiveness of the proposed framework and its robustness to uncertainty perception.

show abstract

Section: Introductionmentioning

confidence: 99%

A framework for improving UAV decision of autonomous navigation from training to application migration under perceptual uncertainty

Sheng,

Liu,

et al. 2024

Meas. Sci. Technol.

View full text Add to dashboard Cite

show abstract

Swarm Cooperative Navigation Using Centralized Training and Decentralized Execution

Azzam

Boiko

Zweiri

2023

Drones

View full text Add to dashboard Cite

The demand for autonomous UAV swarm operations has been on the rise following the success of UAVs in various challenging tasks. Yet conventional swarm control approaches are inadequate for coping with swarm scalability, computational requirements, and real-time performance. In this paper, we demonstrate the capability of emerging multi-agent reinforcement learning (MARL) approaches to successfully and efficiently make sequential decisions during UAV swarm collaborative tasks. We propose a scalable, real-time, MARL approach for UAV collaborative navigation where members of the swarm have to arrive at target locations at the same time. Centralized training and decentralized execution (CTDE) are used to achieve this, where a combination of negative and positive reinforcement is employed in the reward function. Curriculum learning is used to facilitate the sought performance, especially due to the high complexity of the problem which requires extensive exploration. A UAV model that highly resembles the respective physical platform is used for training the proposed framework to make training and testing realistic. The scalability of the platform to various swarm sizes, speeds, goal positions, environment dimensions, and UAV masses has been showcased in (1) a load drop-off scenario, and (2) UAV swarm formation without requiring any re-training or fine-tuning of the agents. The obtained simulation results have proven the effectiveness and generalizability of our proposed MARL framework for cooperative UAV navigation.

show abstract

Learning to Navigate Through Reinforcement Across the Sim2Real Gap

Cited by 2 publications

References 22 publications

A framework for improving UAV decision of autonomous navigation from training to application migration under perceptual uncertainty

A framework for improving UAV decision of autonomous navigation from training to application migration under perceptual uncertainty

Swarm Cooperative Navigation Using Centralized Training and Decentralized Execution

Contact Info

Product

Resources

About