2020 Chinese Automation Congress (CAC) 2020
DOI: 10.1109/cac51589.2020.9327752
|View full text |Cite
|
Sign up to set email alerts
|

UAV online path planning technology based on deep reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 8 publications
0
4
0
Order By: Relevance
“…On the other hand, for different task backgrounds and environments, suitable reward functions can be designed as incentive signals for drone decision-making, helping drones to complete reconnaissance decision-making tasks autonomously through training. Zhao Yu et al used deep reinforcement learning algorithms to construct a control model and a coordination mechanism between drones to control the collaborative flight of multiple fixed wing drones, and verified the effectiveness of collision avoidance during collaborative flight of multiple fixed wing drones; Fan Longtao et al proposed a reinforcement learning method based on attention mechanism [32,33]. Firstly, a task allocation solution model was constructed through attention mechanism, and then the model was continuously optimized using reinforcement algorithm to obtain an approximate optimal solution.…”
Section: Collaborative Trajectory Planning Technologymentioning
confidence: 99%
“…On the other hand, for different task backgrounds and environments, suitable reward functions can be designed as incentive signals for drone decision-making, helping drones to complete reconnaissance decision-making tasks autonomously through training. Zhao Yu et al used deep reinforcement learning algorithms to construct a control model and a coordination mechanism between drones to control the collaborative flight of multiple fixed wing drones, and verified the effectiveness of collision avoidance during collaborative flight of multiple fixed wing drones; Fan Longtao et al proposed a reinforcement learning method based on attention mechanism [32,33]. Firstly, a task allocation solution model was constructed through attention mechanism, and then the model was continuously optimized using reinforcement algorithm to obtain an approximate optimal solution.…”
Section: Collaborative Trajectory Planning Technologymentioning
confidence: 99%
“…As shown in Figure 3, the combination of different coefficients can determine the shape and direction of the path. In previous researches [14][15][16], receding horizon control (RHC) strategy was mostly used to optimize these coefficients online. However, the serial solution mechanism of RHC cannot well meet the real-time requirements in complex radio environments.…”
Section: Problem Formulationmentioning
confidence: 99%
“…The relative positions between UAV, the destination and obstacles are extracted from the sample data as the input of the neural network, and the reaction coefficient of the IFDS model is used as the output of the neural network. The authors in [15,16] adopt the deep reinforcement learning (DRL) algorithm to optimize the reaction coefficients, which retains the advantages of the analytical method and maintains a high calculation speed. The algorithm has great application potential.…”
Section: Introductionmentioning
confidence: 99%
“…Several online path planners that are commonly used include Neural Network (NN), Stochastic models, Probabilistic models and Graph. NN-based approaches are choosen due to their hardware advancement and big data availability such as an improved Q-learning based method for reactive obstacle avoidance [13], a deep reinforcement learning approach in assisted edge computing networks [14], and a deep reinforcement learning technique to calculate online path planning [15]. Those papers claimed that generated online path is smooth, fast and fuel efficient.…”
Section: Introductionmentioning
confidence: 99%