2019
DOI: 10.1109/tiv.2019.2904417
|View full text |Cite
|
Sign up to set email alerts
|

Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning

Abstract: Freeway merging in congested traffic is a significant challenge toward fully automated driving. Merging vehicles need to decide not only how to merge into a spot, but also where to merge. We present a method for the freeway merging based on multi-policy decision making with a reinforcement learning method called passive actorcritic (pAC), which learns with less knowledge of the system and without active exploration. The method selects a merging spot candidate by using the state value learned with pAC. We evalu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 47 publications
(23 citation statements)
references
References 27 publications
0
23
0
Order By: Relevance
“…Compared to existing methods, the proposed method offers several advantages, as described in Section VII. In future work, the proposed method should be extended for applications involving the control of unknown nonlinear systems, e.g., autonomous vehicles involving human interactions [1]. The realization of effective control requires the development of mathematical models of such unknown systems, regarding which only limited information is available.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Compared to existing methods, the proposed method offers several advantages, as described in Section VII. In future work, the proposed method should be extended for applications involving the control of unknown nonlinear systems, e.g., autonomous vehicles involving human interactions [1]. The realization of effective control requires the development of mathematical models of such unknown systems, regarding which only limited information is available.…”
Section: Discussionmentioning
confidence: 99%
“…Let us focus on a partially unknown nonlinear system for which the input matrix B is known but a true drift term f tr (x) is unknown. For example, such a partially unknown system can arise owing to the interaction between autonomous vehicles and manually operated vehicles [1]. A partially unknown system can be expressed as…”
Section: B Identification Of Drift Terms Using Kernel-based Functionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Several control approaches have been developed for the speed harmonisation using Q‐learning (QL) [13], model predictive control (MPC) [14], and proportional–integral feedback [16]. The MC alleviates the congestion at the merging area by assigning priority to vehicles [17–19]. Some methods have been proposed for the MC using a heuristic algorithm [18] or a reinforcement learning (RL) [19].…”
Section: Introductionmentioning
confidence: 99%
“…The MC alleviates the congestion at the merging area by assigning priority to vehicles [17–19]. Some methods have been proposed for the MC using a heuristic algorithm [18] or a reinforcement learning (RL) [19]. Distinguishing from the above studies, we consider the mixed traffic environment and combine the mechanisms of speed harmonisation and MC to improve the traffic flow and fuel consumption.…”
Section: Introductionmentioning
confidence: 99%