2020
DOI: 10.1609/aaai.v34i05.6216
|View full text |Cite
|
Sign up to set email alerts
|

Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence

Abstract: Learning agents that are not only capable of taking tests, but also innovating is becoming a hot topic in AI. One of the most promising paths towards this vision is multi-agent learning, where agents act as the environment for each other, and improving each agent means proposing new problems for others. However, existing evaluation platforms are either not compatible with multi-agent settings, or limited to a specific game. That is, there is not yet a general evaluation platform for research on multi-agent int… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 18 publications
0
6
0
Order By: Relevance
“…The objective for each agent is to maximize the expected cumulative reward received during the game. For a cooperative POSG, we quote the definition in Song et al (2020),…”
Section: Preliminariesmentioning
confidence: 99%
“…The objective for each agent is to maximize the expected cumulative reward received during the game. For a cooperative POSG, we quote the definition in Song et al (2020),…”
Section: Preliminariesmentioning
confidence: 99%
“…Several papers Song et al (2020b); Lowe et al (2017) have introduced MARL benchmarks for specific domains, but do not measure generalisation and don't use learning agents to produce evaluation tasks. Another approach with a long history in game theory involves organizing a competition between strategies submitted by different research groups (e.g.…”
Section: Related Workmentioning
confidence: 99%
“…The objective for each agent is to maximize the expected cumulative reward received during the game. For a cooperative POSG, we quote the definition in Song et al [42],…”
Section: Preliminariesmentioning
confidence: 99%