“…Cooperative Multi-Agent Reinforcement Learning (MARL) methods have addressed numerous challenges in both virtual and real-world scenarios, such as traffic signal control [24,17], automated freight handling [6], and autonomous driving [29,28]. Cooperative MARL * Corresponding Author.…”