Tackling Real-World Autonomous Driving using Deep Reinforcement Learning

Maramotti, Paolo; Capasso, Alessandro Paolo; Bacchiani, Giulio; Broggi, Alberto

doi:10.1109/iv51971.2022.9827302

Cited by 8 publications

(4 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…RL enables the systems to learn from the data obtained from the environment to make the correct decision. RL algorithms are used for the decision making and maneuver execution systems like lane change and keeping [76][77][78][79][80][81], overtaking maneuvers [82], intersection and roundabout handling [83][84]. According to [76], there are two crucial components of autonomous driving systems -planning and control systems.…”

Section: Autonomous Drivingmentioning

confidence: 99%

“…RL algorithms are used for the decision making and maneuver execution systems like lane change and keeping [76][77][78][79][80][81], overtaking maneuvers [82], intersection and roundabout handling [83][84]. According to [76], there are two crucial components of autonomous driving systems -planning and control systems. The planning systems predict the path the self-driving car should take while the control systems are responsible for low-level actions like controlling steering angles, throttle, and break.…”

Section: Autonomous Drivingmentioning

confidence: 99%

See 1 more Smart Citation

Reinforcement Learning Review: Past Acts, Present Facts and Future Prospects

Kommey,

Isaac,

Tamakloe

et al. 2024

ITJRD

View full text Add to dashboard Cite

Reinforcement Learning (RL) is fast gaining traction as a major branch of machine learning, its applications have expanded well beyond its typical usage in games. Several subfields of reinforcement learning like deep reinforcement learning and multi-agent reinforcement learning are also expanding rapidly. This paper provides an extensive review on the field from the point of view of Machine Learning (ML). It begins by providing a historical perspective on the field then proceeds to lay a theoretical background on the field. It further discusses core reinforcement learning problems and approaches taken by different subfields before discussing the state of the art in the field. An inexhaustive list of applications of reinforcement learning is provided and their practicability and scalability assessed. The paper concludes by highlighting some open areas or issues in the field

show abstract

Section: Autonomous Drivingmentioning

confidence: 99%

Section: Autonomous Drivingmentioning

confidence: 99%

Reinforcement Learning Review: Past Acts, Present Facts and Future Prospects

Kommey,

Isaac,

Tamakloe

et al. 2024

ITJRD

View full text Add to dashboard Cite

show abstract

“…The development of RL in the field of autonomous driving has transitioned from foundational models to advanced algorithms capable of addressing complex and dynamic driving tasks. Early RL methods focused on simple control tasks [10], laying the foundation for more complex methods such as Deep Q-Network (DQN) for higher-dimensional state and action spaces [11], Deterministic Policy Gradient (DDPG) [12,13], Proximal Policy Optimization (PPO) [14], Trust Region Policy Optimization (TRPO) [14], and Asynchronous Advantage Actor Critic (A3C) [15], among others [16]. These methods have been used in the field of autonomous driving and have demonstrated good performance [15,17,18].…”

Section: Introductionmentioning

confidence: 99%

“…Early RL methods focused on simple control tasks [10], laying the foundation for more complex methods such as Deep Q-Network (DQN) for higher-dimensional state and action spaces [11], Deterministic Policy Gradient (DDPG) [12,13], Proximal Policy Optimization (PPO) [14], Trust Region Policy Optimization (TRPO) [14], and Asynchronous Advantage Actor Critic (A3C) [15], among others [16]. These methods have been used in the field of autonomous driving and have demonstrated good performance [15,17,18]. However, despite the remarkable performance of autonomous driving products showcased by companies such as Waymo, Baidu Apollo, and others in regular traffic, their safety reports have documented numerous emergency takeover incidents when faced with unknown or complex situations.…”

Section: Introductionmentioning

confidence: 99%

Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning

Zhao,

Chen,

Fan

et al. 2024

Sensors

View full text Add to dashboard Cite

Reinforcement Learning (RL) methods are regarded as effective for designing autonomous driving policies. However, even when RL policies are trained to convergence, ensuring their robust safety remains a challenge, particularly in long-tail data. Therefore, decision-making based on RL must adequately consider potential variations in data distribution. This paper presents a framework for highway autonomous driving decisions that prioritizes both safety and robustness. Utilizing the proposed Replay Buffer Constrained Policy Optimization (RECPO) method, this framework updates RL strategies to maximize rewards while ensuring that the policies always remain within safety constraints. We incorporate importance sampling techniques to collect and store data in a Replay buffer during agent operation, allowing the reutilization of data from old policies for training new policy models, thus mitigating potential catastrophic forgetting. Additionally, we transform the highway autonomous driving decision problem into a Constrained Markov Decision Process (CMDP) and apply our proposed RECPO for training, optimizing highway driving policies. Finally, we deploy our method in the CARLA simulation environment and compare its performance in typical highway scenarios against traditional CPO, current advanced strategies based on Deep Deterministic Policy Gradient (DDPG), and IDM + MOBIL (Intelligent Driver Model and the model for minimizing overall braking induced by lane changes). The results show that our framework significantly enhances model convergence speed, safety, and decision-making stability, achieving a zero-collision rate in highway autonomous driving.

show abstract