Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Xie, Ronglei; Zhang, Meng; Wang, Lifeng; Li, Haochen; Wang, Kaipeng; Wu, Zhe

doi:10.1109/access.2021.3057485

Cited by 81 publications

(28 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…There are both black grids representing obstacles and white grids representing free walking in the grid, which makes the robot operating environment from complex to simple, and the path planning problem is relatively simple. Therefore, for the grid method, the most important thing is to determine the size of the divided grid, which will directly affect the operation of the algorithm and the final planning effect [19] (2) Geometric method uses geometric features (such as points, lines, and surfaces) to represent objects in the scene; abstracts the environmental information collected by the sensors carried by the robot into common geometric features, such as vertices, lines, curves, and corners; and then describes and records them with coordinates [20] (3) The expression of topological graph method is more abstract. It uses graphs to represent the spatial relationship between objects in the environment, and the nodes of the graph represent the feature points in the environment [21].…”

Section: Environmental Modelingmentioning

confidence: 99%

Path Planning of Storage and Logistics Mobile Robot Based on ACA-E Algorithm

Zhao

2022

Journal of Sensors

View full text Add to dashboard Cite

In this paper, warehouse logistics robot as the research object, starting from the reality of modern e-commerce logistics industry, proposed a warehousing logistics mobile robot path planning method. Ant colony algorithm is used to plan the forward path of mobile robot in static and dynamic environment of warehouse logistics. The results show that the elite-based strategy improves the global search capability of ACA and eliminates redundant nodes in the path, and the path length of the obtained plan is significantly better than that of traditional ACA, which is reduced by 3.46% and 5.90%, respectively, indicating that the elite-based strategy and the central-point-based smoothing method play their roles. The path length of the robot’s final operation is larger than that of the global path planning, which increases by 1, accounting for 6.67% of the original path length. Therefore, the storage and logistics mobile robot based on ACA-E algorithm has short driving distance and superior obstacle avoidance ability.

show abstract

Section: Environmental Modelingmentioning

confidence: 99%

Path Planning of Storage and Logistics Mobile Robot Based on ACA-E Algorithm

Zhao

2022

Journal of Sensors

View full text Add to dashboard Cite

show abstract

“…If the worst case happens, all the state–action pairs in the environment may be searched. Therefore, how to increase the search efficiency and convergence speed of the QL algorithm in path planning is a common challenge for scholars [ 18 , 19 , 20 ].…”

Section: Related Workmentioning

confidence: 99%

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

Lyu

Yang

et al. 2022

Sensors

View full text Add to dashboard Cite

How to generate the path planning of mobile robots quickly is a problem in the field of robotics. The Q-learning(QL) algorithm has recently become increasingly used in the field of mobile robot path planning. However, its selection policy is blind in most cases in the early search process, which slows down the convergence of optimal solutions, especially in a complex environment. Therefore, in this paper, we propose a continuous local search Q-Learning (CLSQL) algorithm to solve these problems and ensure the quality of the planned path. First, the global environment is gradually divided into independent local environments. Then, the intermediate points are searched in each local environment with prior knowledge. After that, the search between each intermediate point is realized to reach the destination point. At last, by comparing other RL-based algorithms, the proposed method improves the convergence speed and computation time while ensuring the optimal path.

show abstract

“…More recently, Xie et al [28] formulate UAV path planning as a POMDP. They use recurrent neurons to handle the partial observability by extracting crucial information from historical state-action pairs, and convolutional neurons to capture spatial feature information from the observation prior to determining the Q values of a state.…”

Section: B Background Of Reinforcement Learning Algorithmsmentioning

confidence: 99%

Adaptive UAV Swarm Mission Planning by Temporal Difference Learning

Gopalakrishnan

Al‐Rubaye

Tsourdos

2021

2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC)

View full text Add to dashboard Cite

The prevalence of Unmanned Aerial Vehicles in precision agriculture has been growing rapidly. This paper tackles the UAV global mission planning problem by first incorporating a greater capacity for human-machine teaming in the design of a flexibly autonomous, near-fully-distributed Mission Management System for UAV swarms. Subsequently, to maximize the efficiency with which missions are carried out, the two problems of global mission planning: task assignment/routing and path planning, were solved together, for small problem sizes, by an integrated solution. This consists of a geometric clustering algorithm which prioritizes the minimization of overall mission time, and an off-policy, modelfree Temporal Difference Learning global agent capable of learning about an initially unknown mission environment through simulations. The latter component makes the solution adaptive to missions with different requirements.

show abstract

Unmanned Aerial Vehicle Path Planning Algorithm Based on Deep Reinforcement Learning in Large-Scale and Dynamic Environments

Cited by 81 publications

References 29 publications

Path Planning of Storage and Logistics Mobile Robot Based on ACA-E Algorithm

Path Planning of Storage and Logistics Mobile Robot Based on ACA-E Algorithm

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

Adaptive UAV Swarm Mission Planning by Temporal Difference Learning

Contact Info

Product

Resources

About