“…In recent years, data-driven sequential decision-making has received a lot of attentions and finds a wide range of applications in operations management, such as dynamic inventory control (see, e.g., Huh et al (2011), Chen and Plambeck (2008), Chen et al (2019b,a), Lei et al (2019)), dynamic pricing (see, e.g., Zeevi (2009, 2015), Wang et al (2014), Chen et al (2019c), Broder and Rusmevichientong (2012)), dynamic assortment optimization (see, e.g., Rusmevichientong and Topaloglu (2012), Saure and Zeevi (2013), Agrawal et al (2019), Wang et al (2018), Chen et al (2018)). Take the personalized/contextual dynamic pricing as an example; it is usually assumed that the underlying demand, which is a function of the price and customer's contextual information, follows a certain probabilistic model with unknown parameters.…”