2021
DOI: 10.48550/arxiv.2108.05030
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DQ-GAT: Towards Safe and Efficient Autonomous Driving with Deep Q-Learning and Graph Attention Networks

Abstract: Autonomous driving in multi-agent and dynamic traffic scenarios is challenging, where the behaviors of other road agents are uncertain and hard to model explicitly, and the egovehicle should apply complicated negotiation skills with them to achieve both safe and efficient driving in various settings, such as giving way, merging and taking turns. Traditional planning methods are largely rule-based and scale poorly in these complex dynamic scenarios, often leading to reactive or even overly conservative behavior… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 29 publications
0
3
0
Order By: Relevance
“…Conditional imitation learning [2] is the most widely used framework, which learns to map raw sensor data, such as LIDAR data [15] and image data [5], to output control signals directly. To improve the generalization performance, semantic information (such as BEVs) [7], [8] is widely used as input of the end-to-end framework.…”
Section: A Learning Based Navigationmentioning
confidence: 99%
See 2 more Smart Citations
“…Conditional imitation learning [2] is the most widely used framework, which learns to map raw sensor data, such as LIDAR data [15] and image data [5], to output control signals directly. To improve the generalization performance, semantic information (such as BEVs) [7], [8] is widely used as input of the end-to-end framework.…”
Section: A Learning Based Navigationmentioning
confidence: 99%
“…Our system input consists of three parts {B, C , N }, where B is the historical trajectory information of all vehicles in a local observation range, C is the scenarios information and N is the desired route information of ego-vehicle. Similar to [7], [8], the BEV information is considered. More specifically, we consider the historical trajectory information of n vehicles surround the ego vehicle, {x t i , y t i , vx t i , vy t i , θ t i }, where x t i and y t i denote the relative position to the ego vehicle in lateral and longitudinal directions respectively, vx t i and vy t i denote the relative velocity information, and θ t i denotes the relative heading.…”
Section: Preliminariesmentioning
confidence: 99%
See 1 more Smart Citation