2022
DOI: 10.3233/aic-220113
|View full text |Cite
|
Sign up to set email alerts
|

Developing, evaluating and scaling learning agents in multi-agent environments

Abstract: The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and training 3-d humanoids in difficult team coordination tasks. A signature aim of our group is to use the resources and expertise made available to us at DeepMind in deep reinforcement learning to explore multi-agent systems in complex environments and use these benchmarks to a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 46 publications
0
2
0
Order By: Relevance
“…Coins [31,33,51,71] is a mixed-motive Markov game [54] played by n = 2 players (Fig. 2; see also Appendix A for full task details).…”
Section: Taskmentioning
confidence: 99%
“…Coins [31,33,51,71] is a mixed-motive Markov game [54] played by n = 2 players (Fig. 2; see also Appendix A for full task details).…”
Section: Taskmentioning
confidence: 99%
“…A few algorithms eschew training/testing but still cannot be considered fully decentralized since they require each player to be able to access the policies of other players (Foerster et al, 2018b; Jaques et al, 2019). Most algorithms in this class that can robustly find socially beneficial equilibria in collective action problems require public rewards (Eccles et al, 2019; Gemp et al, 2020; Hughes et al, 2018; McKee et al, 2020; Peysakhovich and Lerer, 2018; Wang et al, 2018) or the ability to redistribute rewards amongst agents’ (Lupu and Precup, 2020; Wang et al, 2021). This class of algorithms assumes that while they are learning all agents will have real-time access to one another’s rewards.…”
Section: Related Workmentioning
confidence: 99%
“…• DeepMind [7] • Five AI [9] • Heriot-Watt University [13] • King's College London [2] • Teesside University [8] • University of Aberdeen [3] • University of Edinburgh [1] • University of Essex [11] • University of Lancaster [4]…”
mentioning
confidence: 99%