“…In the non-stationary case, the rewards {R t (i)} i∈{1,...,N } are drawn from a set of unknown and non-stationary distribution p i (t) that may vary over time. ese variations can be continuous (Slivkins & Upfal, 2008) or abrupt (Garivier & Moulines, 2011;Besbes, Gur, & Zeevi, 2014;Raj & Kalyani, 2017). In this work, we'll consider only abruptly changing environments: the distribution of rewards remains stationary for a given period and a change occurs at an unknown time instants (breakpoints).…”