“…Methods in AutoRL can be placed on a spectrum of automation, where on one end would be methods to select pipelines and on the other would be methods that try to discover new algorithms groundup in a data-driven manner (Oh et al, 2020). Techniques from the Automated Machine Learning literature (Hutter et al, 2019) then transfer to the RL setting, including algorithm selection (Laroche & Feraud, 2022), hyperparameter optimization (Li et al, 2019;Parker-Holder et al, 2020;Wan et al, 2022), dynamic configurations (Adriaensen et al, 2022), learned optimizers , and neural architecture search (Wan et al, 2022). Similarly, techniques from the Evolutionary optimization and Meta-Learning literature naturally transfer to this setting with methods aiming to meta-learn parts of the RL pipeline such as update rules (Oh et al, 2020), loss functions (Salimans et al, 2017;Kirsch et al, 2020), symbolic representations of algorithms (Alet et al, 2020;Co-Reyes et al, 2021;Luis et al, 2022), or concept drift (Lu et al, 2022).…”