2021
DOI: 10.1049/itr2.12107
|View full text |Cite
|
Sign up to set email alerts
|

Continuous decision‐making for autonomous driving at intersections using deep deterministic policy gradient

Abstract: Intersections have been identified as the most complex and accident-prone traffic scenarios on road. Making appropriate decisions at intersections for driving safety, efficiency, and comfort become a challenging task for autonomous vehicles (AVs). The existing research on AV decision-making at intersections either considers a single scenario only with discrete behaviour outputs or ignores the requirements for driving efficiency and comfort. To address these problems, this study proposed a deep reinforcement le… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 25 publications
(8 citation statements)
references
References 54 publications
0
8
0
Order By: Relevance
“…In contrast, the learning-based method is more flexible in execution. The dominant method in learning-based AIM is multi-agent reinforcement learning (MARL), including Deep Q Learning (DQN) [39][40][41][42][43], Dueling Deep Q Learning (DDQN) [44], Deep Deterministic Policy Gradient (DDPG) [45], Proximal Policy Optimization (PPO) [46][47][48], Twin Delayed Deep Deterministic Policy Gradient (TD3) [49][50][51], and Soft Actor-Critic (SAC) [52]. Besides, DCL-AIM introduced coordinate state and independent state for CAVs to react in different scenarios [53], RAIM [54] and adv.RAIM [55] applied encoder-decoder structure with LSTM cell, AIM5LA further considered communication delay based on the adv.RAIM [56], and game theory was utilized to determine the leader-follower to enhance the performance of reinforcement learning in [52,57].…”
Section: Literature Reviewmentioning
confidence: 99%
“…In contrast, the learning-based method is more flexible in execution. The dominant method in learning-based AIM is multi-agent reinforcement learning (MARL), including Deep Q Learning (DQN) [39][40][41][42][43], Dueling Deep Q Learning (DDQN) [44], Deep Deterministic Policy Gradient (DDPG) [45], Proximal Policy Optimization (PPO) [46][47][48], Twin Delayed Deep Deterministic Policy Gradient (TD3) [49][50][51], and Soft Actor-Critic (SAC) [52]. Besides, DCL-AIM introduced coordinate state and independent state for CAVs to react in different scenarios [53], RAIM [54] and adv.RAIM [55] applied encoder-decoder structure with LSTM cell, AIM5LA further considered communication delay based on the adv.RAIM [56], and game theory was utilized to determine the leader-follower to enhance the performance of reinforcement learning in [52,57].…”
Section: Literature Reviewmentioning
confidence: 99%
“…Similarly, continuous decision-making for intersection cases in top three accidentprone crossing paths in a Carla simulator using DDPG and CNN surpassed the limitation of single scenario with discrete behavior outputs fulfilling the criteria for safe AVS [140]. DDQG was utilized to address the MDP problem and find the best driving strategy by mapping the link between traffic photos and vehicle operations through CNN that solved the common drawback of rule-based RL methods deployed in intersection cases.…”
Section: Decision Makingmentioning
confidence: 99%
“…Besides, learning-based approaches have been widely used for autonomous decision making. In [21], an end-to-end decision-making framework is designed by using a convolutional neural network to map the relationship between traffic images and vehicle operations. In [22], a safe reinforcement learning (RL) approach is proposed to address the driving conflicts at autonomously navigate intersections, which shows good robustness against perception errors and occlusions.…”
Section: Introductionmentioning
confidence: 99%