2023
DOI: 10.3390/s23020990
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Study of Cooperative Platoon Merging Control Based on Reinforcement Learning

Abstract: The time that a vehicle merges in a lane reduction can significantly affect passengers’ safety, comfort, and energy consumption, which can, in turn, affect the global adoption of autonomous electric vehicles. In this regard, this paper analyzes how connected and automated vehicles should cooperatively drive to reduce energy consumption and improve traffic flow. Specifically, a model-free deep reinforcement learning approach is used to find the optimal driving behavior in the scenario in which two platoons are … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 58 publications
0
2
0
Order By: Relevance
“…However, the literature applies the DDPG algorithm to solve the platoon cooperative control problem with a single agent, and the reward value curves of training fluctuate a lot, and the effect is more general. [47] and [48] point out that the PPO algorithm can be used to achieve cooperative control of vehicles under the platoon merging operation in the mixed traffic environment of urban road intersections and a set of PPO hyperparameters is proposed to explore the effect of the automated extraction feature on policy prediction. However, it is unclear how the PPO algorithm controls vehicle speed, acceleration, and other motion states.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the literature applies the DDPG algorithm to solve the platoon cooperative control problem with a single agent, and the reward value curves of training fluctuate a lot, and the effect is more general. [47] and [48] point out that the PPO algorithm can be used to achieve cooperative control of vehicles under the platoon merging operation in the mixed traffic environment of urban road intersections and a set of PPO hyperparameters is proposed to explore the effect of the automated extraction feature on policy prediction. However, it is unclear how the PPO algorithm controls vehicle speed, acceleration, and other motion states.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The first element of the control sequence is entered into the control system as an actual control quantity as follows: (48) After the end of the current control cycle, it proceeds to the next one, where the loop iterates and achieves the tracking of the desired lane-changing trajectory.…”
Section: Cb C a B C B C A B C A B C A B C A B C Amentioning
confidence: 99%