2023
DOI: 10.3390/pr11092677
|View full text |Cite
|
Sign up to set email alerts
|

Active Steering Controller for Driven Independently Rotating Wheelset Vehicles Based on Deep Reinforcement Learning

Zhenggang Lu,
Juyao Wei,
Zehan Wang

Abstract: This paper proposes an active steering controller for Driven Independently Rotating Wheelset (DIRW) vehicles based on deep reinforcement learning (DRL). For the two-axle railway vehicles equipped with Independently Rotating Wheelsets (IRWs), each wheel connected to a wheel-side motor, the Ape-X DDPG controller, an enhanced version of the Deep Deterministic Policy Gradient (DDPG) algorithm, is adopted. Incorporating Distributed Prioritized Experience Replay (DPER), Ape-X DDPG trains neural network function appr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…For a given experience tuple j, agent i calculates the current Q value dependent on the s j and the collective actions, while the target critic network determines the Q value for s j+1 . The TD error can be formulated using Equation (13).…”
Section: Improved Maddpg With Permentioning
confidence: 99%
See 1 more Smart Citation
“…For a given experience tuple j, agent i calculates the current Q value dependent on the s j and the collective actions, while the target critic network determines the Q value for s j+1 . The TD error can be formulated using Equation (13).…”
Section: Improved Maddpg With Permentioning
confidence: 99%
“…Recognizing the limitations of traditional control strategies, there is a growing interest in data-driven methods. Our previous research explored the application of deep reinforcement learning (DRL)-based controllers, including the deep deterministic policy gradient (DDPG) and Ape-X DDPG [13,14], leveraging deep neural networks' ability to fit nonlinear systems. Nevertheless, several limitations are encountered: (a) the existing DRL-based controllers require multiple dynamic parameters from all IRWs, and the high dimensionality of observation spaces leads to slow convergence during training; (b) current strategies mainly focus on the centralized control of the entire vehicle without achieving local control for an individual IRW, potentially affecting computational efficiency in practical applications.…”
Section: Introductionmentioning
confidence: 99%