2022
DOI: 10.1088/1742-6596/2320/1/012008
|View full text |Cite
|
Sign up to set email alerts
|

The Method for Automatic Adjustment of AGV’s PID Based on Deep Reinforcement Learning

Abstract: This paper proposes a method for automatic adjustment of PID(Proportion Integration Differentiation) based on deep reinforcement learning in order to solve the problem of smooth movement of AGV(Automated Guided Vehicle). First, based on reinforcement learning, the problem of AGV smooth operation is transformed into the solution of PID adjustment operation sequence. The action value model is constructed using the Deep Q-Learning Network(DQN) algorithm. Then, take the AGV adjustment PID as the research object. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 5 publications
0
2
0
Order By: Relevance
“…Meiling Chen presents a deep reinforcement learning method based on a self-attention mechanism, aiming at the problems of weak feature extraction ability and low training efficiency when a reinforcement learning model is used to extract high-dimensional data [7]. A deep reinforcement learning method based on an information bottleneck is proposed to solve the problem that the reinforcement learning algorithm only completes the policy update according to the reward size while ignoring the influence of the feature extraction ability of the model on the strategy learning.…”
Section: Techniques On Target Recognition and Driving Patterns In Aut...mentioning
confidence: 99%
“…Meiling Chen presents a deep reinforcement learning method based on a self-attention mechanism, aiming at the problems of weak feature extraction ability and low training efficiency when a reinforcement learning model is used to extract high-dimensional data [7]. A deep reinforcement learning method based on an information bottleneck is proposed to solve the problem that the reinforcement learning algorithm only completes the policy update according to the reward size while ignoring the influence of the feature extraction ability of the model on the strategy learning.…”
Section: Techniques On Target Recognition and Driving Patterns In Aut...mentioning
confidence: 99%
“…The robot can be trained by interacting with its environment and receiving feedback in the form of rewards or penalties based on the actions it takes. For example, the robot may receive a reward for successfully navigating to a particular location, while receiving a penalty for colliding with an obstacle [33][34][35][36][37]. Over time, the robot can use the feedback it receives to learn the optimal path to take in different environments.…”
Section: Related Workmentioning
confidence: 99%