The Determination of Reward Function in AGV Motion Control Based on DQN

Chen, Yubin; Li, Dancheng; Zhong, Huagang; Zhu, Ouwen; Zhao, Zhijun

doi:10.1088/1742-6596/2320/1/012002

Cited by 3 publications

(2 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The robot can be trained by interacting with its environment and receiving feedback in the form of rewards or penalties based on the actions it takes. For example, the robot may receive a reward for successfully navigating to a particular location, while receiving a penalty for colliding with an obstacle [33][34][35][36][37]. Over time, the robot can use the feedback it receives to learn the optimal path to take in different environments.…”

Section: Related Workmentioning

confidence: 99%

Autonomous Navigation of Robots: Optimization with DQN

et al. 2023

View full text Add to dashboard Cite

In the field of artificial intelligence, control systems for mobile robots have undergone significant advancements, particularly within the realm of autonomous learning. However, previous studies have primarily focused on predefined paths, neglecting real-time obstacle avoidance and trajectory reconfiguration. This research introduces a novel algorithm that integrates reinforcement learning with the Deep Q-Network (DQN) to empower an agent with the ability to execute actions, gather information from a simulated environment in Gazebo, and maximize rewards. Through a series of carefully designed experiments, the algorithm’s parameters were meticulously configured, and its performance was rigorously validated. Unlike conventional navigation systems, our approach embraces the exploration of the environment, facilitating effective trajectory planning based on acquired knowledge. By leveraging randomized training conditions within a simulated environment, the DQN network exhibits superior capabilities in computing complex functions compared to traditional methods. This breakthrough underscores the potential of our algorithm to significantly enhance the autonomous learning capacities of mobile robots.

show abstract

Section: Related Workmentioning

confidence: 99%

Autonomous Navigation of Robots: Optimization with DQN

et al. 2023

View full text Add to dashboard Cite

show abstract

“…All the parameters in the training network are assigned to the target network after the training of a fixed number of steps. The algorithm sets an experience replay unit to reduce the correlation of training samples and improve the instability of the action value function of neural network approximation reinforcement learning [19]. A batch of samples are evenly selected from the experience library and mixed together with the training samples to break the correlation between adjacent training samples and improve the utilization rate of the samples during each training.…”

Section: Introductionmentioning

confidence: 99%

Research on the Local Path Planning for Mobile Robots based on PRO-Dueling Deep Q-Network (DQN) Algorithm

Zhang,

Li,

Zhang

et al. 2023

IJACSA

View full text Add to dashboard Cite

This paper proposes a Pro-Dueling DQN algorithm to solve the problems of slow convergence speed and waste of effective experience of the traditional DQN (Deep Q-Network) algorithm for the local path planning of mobile robot. The new algorithm introduces a priority experience playback mechanism based on SumTree to avoid forgetting the learning effective experiences as the number of samples in the experience pool increases. A more detailed reward and punishment function is designed for the new algorithm to reduce the blindness of extracting experience in the early stages of algorithm training. The feasibility of the algorithm is verified by comparative verification on ROS simulation platform and real scene, respectively. The results show that the designed Pro-Dueling DQN algorithm converges faster and the length of planned path is shorter than that of the original DQN algorithm.

show abstract