2021
DOI: 10.48550/arxiv.2104.08876
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning

Abstract: It is essential for an automated vehicle in the field to perform discretionary lane changes with appropriate roadmanship -driving safely and efficiently without annoying or endangering other road users -under a wide range of traffic cultures and driving conditions. While deep reinforcement learning methods have excelled in recent years and been applied to automated vehicle driving policy, there are concerns about their capability to quickly adapt to unseen traffic with new environment dynamics. We formulate th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 25 publications
0
2
0
Order By: Relevance
“…Optimization-based Meta-RL. This type of Meta-RLs separate learning into two loops: inner loop for policy adaptation, and outer loop for policy updates [13]. In most existing work, optimizations are done by computing gradient descents for both loops [3] [5].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Optimization-based Meta-RL. This type of Meta-RLs separate learning into two loops: inner loop for policy adaptation, and outer loop for policy updates [13]. In most existing work, optimizations are done by computing gradient descents for both loops [3] [5].…”
Section: Related Workmentioning
confidence: 99%
“…2) Self-driving: Highway Merge Fig. 3: Highway-Merging Scenario Meta-RL has been applied to solve decision-making problems in autonomous driving [13] [24] [25], but none of the references we had reviewed studied the prior safety, which is an important issue for AVs.…”
Section: A Experimental Setupmentioning
confidence: 99%