2022
DOI: 10.1007/s10458-022-09548-8
|View full text |Cite
|
Sign up to set email alerts
|

Quantifying the effects of environment and population diversity in multi-agent reinforcement learning

Abstract: Generalization is a major challenge for multi-agent reinforcement learning. How well does an agent perform when placed in novel environments and in interactions with new co-players? In this paper, we investigate and quantify the relationship between generalization and diversity in the multi-agent domain. Across the range of multi-agent environments considered here, procedurally generating training levels significantly improves agent performance on held-out levels. However, agent performance on the specific lev… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(11 citation statements)
references
References 16 publications
0
11
0
Order By: Relevance
“…Without adequate diversity, the population may converge prematurely to suboptimal solutions, leading to the stagnation of the learning process. Overfitting to policies in the policy zoo is a significant challenge to generalization [25,59]. Although the diversity of a population has been widely discussed in the evolutionary algorithm community at the genotype level, phenotype level, and the combination of the previous two cases [60], which typically operate on a fixed set of candidate solutions, PB-DRL is often used in dynamic and uncertain environments where the population size and diversity can change over time.…”
Section: Challengesmentioning
confidence: 99%
“…Without adequate diversity, the population may converge prematurely to suboptimal solutions, leading to the stagnation of the learning process. Overfitting to policies in the policy zoo is a significant challenge to generalization [25,59]. Although the diversity of a population has been widely discussed in the evolutionary algorithm community at the genotype level, phenotype level, and the combination of the previous two cases [60], which typically operate on a fixed set of candidate solutions, PB-DRL is often used in dynamic and uncertain environments where the population size and diversity can change over time.…”
Section: Challengesmentioning
confidence: 99%
“…This problem is exacerbated in MARL as agents tend to overfit to their co-players [23]. A interesting approach to address this problem is to incorporate diversity when learning a control policy [28]. Diversity approaches can be broadly classified into two categories: environmental diversity [19,28] and policy diversity [12,25,26].…”
Section: Diversity In Rlmentioning
confidence: 99%
“…A interesting approach to address this problem is to incorporate diversity when learning a control policy [28]. Diversity approaches can be broadly classified into two categories: environmental diversity [19,28] and policy diversity [12,25,26]. In environmental diversity, variants of the given environment are used to train the agents.…”
Section: Diversity In Rlmentioning
confidence: 99%
See 2 more Smart Citations