2021
DOI: 10.48550/arxiv.2105.05701
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep Multi-agent Reinforcement Learning for Highway On-Ramp Merging in Mixed Traffic

Abstract: On-ramp merging is a challenging task for autonomous vehicles (AVs), especially in mixed traffic where AVs coexist with human-driven vehicles (HDVs). In this paper, we formulate the mixed-traffic highway on-ramp merging problem as a multi-agent reinforcement learning (MARL) problem, where the AVs (on both merge lane and through lane) collaboratively learn a policy to adapt to HDVs to maximize the traffic throughput. We develop an efficient and scalable MARL framework that can be used in dynamic traffic where t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
33
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 18 publications
(33 citation statements)
references
References 41 publications
0
33
0
Order By: Relevance
“…On the other hand, multi-agent reinforcement learning (MARL) algorithms have also been explored for autonomous driving tasks [13,14,22,23]. An MARL algorithm with hard-coded safety constraints [13] was proposed to solve the double-merge problem.…”
Section: Data-driven Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…On the other hand, multi-agent reinforcement learning (MARL) algorithms have also been explored for autonomous driving tasks [13,14,22,23]. An MARL algorithm with hard-coded safety constraints [13] was proposed to solve the double-merge problem.…”
Section: Data-driven Methodsmentioning
confidence: 99%
“…In such a framework, a hierarchical temporal abstraction method was applied to reduce the effective horizon and the variance of the gradient estimation error. In the recent literature [23], an MARL algorithm was delivered to solve the onramp merging problem with safety enhancement by a novel priority-based safety supervisor. In addition, a novel MARL approach [22] was realized with the combination of Graphic Convolution Neural Network (GCN) [24] and Deep Q Network (DQN) [25] to better fuse the acquired information from collaborative sensing, showing promising results on a 3-lane freeway containing 2 off-ramps highway environment.…”
Section: Data-driven Methodsmentioning
confidence: 99%
See 3 more Smart Citations