“…As discussed in the background, provided a reduction from solving common-payoff games to solving belief MDPs; independently, Dibangoye et al (2013b) and Oliehoek (2013) discovered similar reductions. These ideas have been leveraged in a large body of work in decentralized control literature (Lessard & Nayyar, 2013;Nayyar et al, 2014;Arabneydi & Mahajan, 2014;Ouyang et al, 2015;Vasconcelos & Martins, 2016;Tavafoghi et al, 2016;Afshari & Mahajan, 2018;Gagrani & Nayyar, 2018;Tavafoghi et al, 2018;Zhang et al, 2019;Gupta, 2021) and machine learning literature (Dibangoye et al, 2013a;MacDermed & Isbell, 2013;Dibangoye et al, 2014a;b;Dibangoye & Buffet, 2018;Foerster et al, 2019;Sokota et al, 2021;Fickinger et al, 2021;Sokota et al, 2022b;Kao et al, 2022). Use cases include game solving (Dibangoye et al, 2013b), expert iteration (Sokota et al, 2021), and decision-time planning Fickinger et al, 2021;Sokota et al, 2022b).…”