2021
DOI: 10.1109/access.2021.3057485
|View full text |Cite
|
Sign up to set email alerts
|

Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Abstract: Path planning is one of the key technologies for autonomous flight of Unmanned Aerial Vehicle. Traditional path planning algorithms have some limitations and deficiencies in the complex and dynamic environment. In this paper, we propose a deep reinforcement learning approach for threedimensional path planning by utilizing the local information and relative distance without global information.UAV can obtain the limited environmental information nearby in the actual scenario with limited sensor capabilities. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 81 publications
(28 citation statements)
references
References 29 publications
0
28
0
Order By: Relevance
“…There are both black grids representing obstacles and white grids representing free walking in the grid, which makes the robot operating environment from complex to simple, and the path planning problem is relatively simple. Therefore, for the grid method, the most important thing is to determine the size of the divided grid, which will directly affect the operation of the algorithm and the final planning effect [19] (2) Geometric method uses geometric features (such as points, lines, and surfaces) to represent objects in the scene; abstracts the environmental information collected by the sensors carried by the robot into common geometric features, such as vertices, lines, curves, and corners; and then describes and records them with coordinates [20] (3) The expression of topological graph method is more abstract. It uses graphs to represent the spatial relationship between objects in the environment, and the nodes of the graph represent the feature points in the environment [21].…”
Section: Environmental Modelingmentioning
confidence: 99%
“…There are both black grids representing obstacles and white grids representing free walking in the grid, which makes the robot operating environment from complex to simple, and the path planning problem is relatively simple. Therefore, for the grid method, the most important thing is to determine the size of the divided grid, which will directly affect the operation of the algorithm and the final planning effect [19] (2) Geometric method uses geometric features (such as points, lines, and surfaces) to represent objects in the scene; abstracts the environmental information collected by the sensors carried by the robot into common geometric features, such as vertices, lines, curves, and corners; and then describes and records them with coordinates [20] (3) The expression of topological graph method is more abstract. It uses graphs to represent the spatial relationship between objects in the environment, and the nodes of the graph represent the feature points in the environment [21].…”
Section: Environmental Modelingmentioning
confidence: 99%
“…If the worst case happens, all the state–action pairs in the environment may be searched. Therefore, how to increase the search efficiency and convergence speed of the QL algorithm in path planning is a common challenge for scholars [ 18 , 19 , 20 ].…”
Section: Related Workmentioning
confidence: 99%
“…More recently, Xie et al [28] formulate UAV path planning as a POMDP. They use recurrent neurons to handle the partial observability by extracting crucial information from historical state-action pairs, and convolutional neurons to capture spatial feature information from the observation prior to determining the Q values of a state.…”
Section: B Background Of Reinforcement Learning Algorithmsmentioning
confidence: 99%