2020
DOI: 10.48550/arxiv.2006.06455
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Individually Inferred Communication for Multi-Agent Cooperation

Abstract: Communication lays the foundation for human cooperation. It is also crucial for multi-agent cooperation. However, existing work focuses on broadcast communication, which is not only impractical but also leads to information redundancy that could even impair the learning process. To tackle these difficulties, we propose Individually Inferred Communication (I2C), a simple yet effective model to enable agents to learn a prior for agent-agent communication. The prior knowledge is learned via causal inference and r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…A family of MARL frameworks exploit communication [15,[40][41][42] sharing mechanism allows an agent to concurrently receive other agents' base actions and the gradient backflows at the current step, which brings more timeliness to help agents make better decisions.…”
Section: Related Workmentioning
confidence: 99%
“…A family of MARL frameworks exploit communication [15,[40][41][42] sharing mechanism allows an agent to concurrently receive other agents' base actions and the gradient backflows at the current step, which brings more timeliness to help agents make better decisions.…”
Section: Related Workmentioning
confidence: 99%
“…[20] uses a bidirectional recurrent neural network for communication between multiple agents. [21] proposes a selectable point-to-point communication method, which is used to determine whether agents communicate with each other via constructing a belief vector. This kind of multi-agent algorithms needs to make a communication operation when agents are executed.…”
Section: Related Workmentioning
confidence: 99%