“…In the constraint approach, the set of alternative models is represented as a hard constraint, and confidence in the nominal is captured by the size of this uncertainty set (see, e.g., Ben-Tal and Nemirovski 1998, 1999, 2000Bertsimas and Sim 2004;El Ghaoui and Lebret 1997;Iyengar 2005;Li and Kwon 2013;Nilim and El Ghaoui 2005;Wiesemann et al 2013). The penalty approach on the other hand expresses confidence in the nominal by penalizing alternative models that deviate too far from the nominal, and does so via a penalty function (soft constraint) that appears in the objective function (see, e.g., Dai Pra et al 1996, Peterson et al 2000, Hansen and Sargent 2007, Jain et al 2010, Kim and Lim 2015, Lim and Shanthikumar 2007. In this paper, we adopt an entropy penalty approach to represent model missspecification, because it provides a number of advantages from the perspective of characterizing the structure of the optimal robust control policy.…”