“…Designing controllers for the general constrained dynamics in ( 1) is challenging and, as a result, traditional online optimization models often adopt simplified versions of (1), such as unconstrained optimization [11,22,38] or time-invariant constraints [26,27] that are known a priori. More specifically, previous studies in online control and online optimization mostly focus on specific forms of constraints or costs depending on different applications, such as switching costs [17,33,35,43], ramping constraints [4,43], polytopic constraints [16], time-varying memoryless cumulative constraints [9,46,[48][49][50], convex loss functions with memory [1][2][3]42] or inventory constraints [30,35]. Within this literature, the goal is to derive policies with either small regret [9,15,33,35,49,51] or competitive ratio [4,17,34,42,43].…”