“…It manages the balance of exploration and exploitation with techniques such as UCT (Kocsis, Szepesvári, and Willemson 2006). Often combined with machine learning, it has been enormously successful in both games (Silver et al 2016;Gao, Müller, and Hayward 2018;Gao 2020;Saffidine 2008;Nijssen and Winands 2010) and non-game applications (Lu et al 2016;Mansley, Weinstein, and Littman 2011;Sabharwal, Samulowitz, and Reddy 2012;Cazenave 2010). In these applications, a perfect simulation model allows for efficient lookahead search.…”