Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022
DOI: 10.24963/ijcai.2022/82
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Multi-Agent Communication via Shapley Message Value

Abstract: Utilizing messages from teammates is crucial in cooperative multi-agent tasks due to the partially observable nature of the environment. Naively asking messages from all teammates without pruning may confuse individual agents, hindering the learning process and impairing the whole system's performance. Most previous work either utilizes a gate or employs an attention mechanism to extract relatively important messages. However, they do not explicitly evaluate each message's value, failing to learn an efficient … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…We subjected T2MAC to rigorous testing across various MARL environments, such as Hallway, MPE, and SMAC. Compared to prominent multi-agent communication strategies like TarMac (Das et al 2019), MAIC (Yuan et al 2022), SMS (Xue et al 2022), and MASIA (Guan et al 2022), T2MAC consistently excelled in both performance and efficiency. Additionally, its versatility shone through across diverse scenarios.…”
Section: Introductionmentioning
confidence: 95%
See 1 more Smart Citation
“…We subjected T2MAC to rigorous testing across various MARL environments, such as Hallway, MPE, and SMAC. Compared to prominent multi-agent communication strategies like TarMac (Das et al 2019), MAIC (Yuan et al 2022), SMS (Xue et al 2022), and MASIA (Guan et al 2022), T2MAC consistently excelled in both performance and efficiency. Additionally, its versatility shone through across diverse scenarios.…”
Section: Introductionmentioning
confidence: 95%
“…Advancing this idea, (Kim et al 2019;Mao et al 2019;Wang et al 2020a;Sun et al 2021) implement a weight-based scheduler, prioritizing agents holding vital observations. Enriching this approach, I2C (Ding, Huang, and Lu 2020), MAGIC (Niu, Paleja, and Gombolay 2021), and SMS (Xue et al 2022) harness methods like causal inference, graph-attention, and Shapley message value to pinpoint ideal communication recipients.…”
Section: Related Workmentioning
confidence: 99%
“…Communication plays a promising role in multi-agent coordination under partial observability [13,84]. Extensive research have been made on learning communication protocols to improve performance on cooperative tasks [20,15,34,73,12,37,77,71,21]. Previous works can be divided into two categories.…”
Section: Related Workmentioning
confidence: 99%
“…Robustness in cooperative MARL Previous cooperative MARL [32] works either concentrate on improving coordination ability from diverse aspects like scalability [33], credit assignment [34], and non-stationarity [35], or applying the cooperative MARL technique to multiple domains like autonomous vehicle teams [36,37], power management [38], and dynamic algorithm configuration [39]. Those approaches ignore the robustness of the learned policy when encountering uncertainties, perturbations, or structural changes in the environment [10], hastening the robustness test in the MARL [40].…”
Section: Related Workmentioning
confidence: 99%
“…We design two instances, which are respectively GP-4r and GP-9r. In GP-4r, there are 4 (2×2) regions, of which each region is a field with size of [3,3], and three agents existing in the map, while, in GP-9r, there exist 9 (3×3) regions and 3 agents.…”
Section: • Gold Panner (Gp)mentioning
confidence: 99%