“…End2End control papers mainly employ either deep neural networks trained offline on real‐world and/or synthetic data (Bechtel et al, ; Bojarski et al, ; C. Chen, Seff, Kornhauser, & Xiao, ; Eraqi et al, ; Fridman et al, ; Hecker et al, ; Rausch et al, ; Xu et al, ; S. Yang et al, ), or DRL systems trained and evaluated in simulation (Jaritz et al, ; Perot, Jaritz, Toromanoff, & Charette, ; Sallab et al, 2017b). Methods for porting simulation trained DRL models to real‐world driving have also been reported (Wayve, 2018), as well as DRL systems trained directly on real‐world image data (Pan et al, , ).…”