“…This type of problem arises across a broad range of application areas (Conn et al, 2009;Audet & Hare, 2017), but has attracted particular recent attention in the learning community for problems such as black-box attacks (Chen et al, 2017;Ughi et al, 2019), hyperparameter tuning (Ghanbari & Scheinberg, 2017;Lakhmiri et al, 2020) and reinforcement learning (Mania et al, 2018;Choromanski et al, 2019). A cur-rent deficiency of DFO methods is their performance on large-scale problems, which is critical to their utility in machine learning; there have been several recent works aimed at improving the scalability of DFO (Bergou et al, 2019;Roberts, 2019;Porcelli & Toint, 2020;Cristofari & Rinaldi, 2020).…”