“…While a lot of work has been done in the full information i.i.d. statistical setting [Baxter, 2000, Khodak et al, 2019, Maurer and Pontil, 2013, Maurer et al, 2016, Denevi et al, 2018, Tripuraneni et al, 2021, Boursier et al, 2022, investigation of meta-learning in the partial information interactive setting are lacking and we are only aware of few recent works [Cella et al, 2020, Yang et al, 2022. More work has been done on the multitask bandit setting [Kveton et al, 2021, Hu et al, 2021, Yang et al, 2020, Cella and Pontil, 2021, however the goal there is different, in that we wish to learn well a prescribed finite set of tasks as opposed to leverage knowledge from these to learn a novel downstream task.…”