“…Delays in multi-arm bandit (MAB). Delays were extensively studied in MAB and optimization both in the stochastic setting (Agarwal & Duchi, 2012;Vernade et al, 2017;Pike-Burke et al, 2018;Cesa-Bianchi et al, 2018;Zhou et al, 2019;Gael et al, 2020;Lancewicki et al, 2021;Cohen et al, 2021a), and the adversarial setting (Quanrud & Khashabi, 2015;Cesa-Bianchi et al, 2016;Thune et al, 2019;Bistritz et al, 2019;Zimmert & Seldin, 2020;Ito et al, 2020;Gyorgy & Joulani, 2021;van der Hoeven & Cesa-Bianchi, 2021).…”