Proceedings of the Genetic and Evolutionary Computation Conference 2022
DOI: 10.1145/3512290.3528705
|View full text |Cite
|
Sign up to set email alerts
|

Approximating gradients for differentiable quality diversity in reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(16 citation statements)
references
References 25 publications
0
16
0
Order By: Relevance
“…Quality diversity for reinforcement learning (QD-RL). As defined in prior work [12], QD-RL is a special instance of QD in which φ parameterizes a reinforcement learning (RL) agent's policy π φ and the objective is the expected discounted return of the agent. QD-RL extends Markov Decision Processes (MDPs) [28] and is formulated as a tuple (S, U, p, r, γ, m).…”
Section: Problem Statementmentioning
confidence: 99%
See 3 more Smart Citations
“…Quality diversity for reinforcement learning (QD-RL). As defined in prior work [12], QD-RL is a special instance of QD in which φ parameterizes a reinforcement learning (RL) agent's policy π φ and the objective is the expected discounted return of the agent. QD-RL extends Markov Decision Processes (MDPs) [28] and is formulated as a tuple (S, U, p, r, γ, m).…”
Section: Problem Statementmentioning
confidence: 99%
“…For example, in locomotion, exploring corresponds to finding new controllers which use the robot's feet a different amount, while optimizing corresponds to making existing controllers walk faster. Prior work [12] shows that Fig. 1: We propose variants of the CMA-MAE algorithm which scale to high-dimensional controllers.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…6The following website serves as a database with research related to QD: https://qualitydiversity.github.io/ maintained by Antoine Cully convergence search for preserving the high-performing individuals within the novel niches [114]. Multi-dimensional Archive of Phenotypic Elites (MAP-Elites) is another algorithm in the QD family, and one that has gained considerable popularity in multiple areas such as games [115][116][117] and robotics [118,119]. As the other QD algorithms, MAP-Elites explores the behavioral space for a collection of solutions that are both highperforming and diverse among each other, with the caveat that MAP-Elites discretizes the behavior space as a grid of cells informed by a set of feature dimensions that illuminate the behavior space.…”
Section: Quality Diversitymentioning
confidence: 99%