“…Furthermore, how to generate diverse strategies has been preliminarily studied in the reinforcement learning community. In specific, diverse strategies can be obtained in various ways, including adding some diversity regularization to the optimization objective (Abdullah et al, 2019), randomly searching in some diverse parameter space (Plappert et al, 2018;Fortunato et al, 2018), using information-based strategy proposal (Eysenbach et al, 2018;Gupta et al, 2018), and searching diverse strategies with evolutionary algorithms (Agapitos et al, 2008;Wang et al, 2019;Jaderberg et al, 2017;2019). More recently, researchers from DeepMind propose a league training paradigm to obtain a Grandmaster level StarCraft II AI (i.e., AlphaStar) by training a diverse league of continually adapting strategies and counter-strategies (Vinyals et al, 2019).…”