2020
DOI: 10.48550/arxiv.2008.04589
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Model-Based Quality-Diversity Search for Efficient Robot Learning

Abstract: Despite recent progress in robot learning, it still remains a challenge to program a robot to deal with open-ended object manipulation tasks. One approach that was recently used to autonomously generate a repertoire of diverse skills is a novelty based Quality-Diversity (QD) algorithm. However, as most evolutionary algorithms, QD suffers from sampleinefficiency and, thus, it is challenging to apply it in real-world scenarios. This paper tackles this problem by integrating a neural network that predicts the beh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 19 publications
(31 reference statements)
0
5
0
Order By: Relevance
“…The first approach is to leverage the efficiency of other optimization methods such as evolution strategies [7,11,19] and policy-gradients [40,43]. The other line of work known as model-based qualitydiversity [21,28,34], reduces the number of evaluations required 1 will be made available after acceptance. through the use of surrogate models to provide a prediction of the BD and fitness.…”
Section: Related Work 21 Quality-diversitymentioning
confidence: 99%
“…The first approach is to leverage the efficiency of other optimization methods such as evolution strategies [7,11,19] and policy-gradients [40,43]. The other line of work known as model-based qualitydiversity [21,28,34], reduces the number of evaluations required 1 will be made available after acceptance. through the use of surrogate models to provide a prediction of the BD and fitness.…”
Section: Related Work 21 Quality-diversitymentioning
confidence: 99%
“…These methods have been used to generate repertoires of primitive actions for robot locomotion [23], [16] or simple manipulations with a robot arm [17]. The generated repertoires have already been used to bootstrap deep learning approaches [25], [26]. We propose to extend novelty-based repertoire generation methods with exploration of multiple behavior spaces such as in [27], [28] but with an additional focus on setups with very sparse interactions.…”
Section: Related Workmentioning
confidence: 99%
“…SAIL integrates surrogate models, in the form of Gaussian Process (GP) models, to approximate the objective function and reduce the number of evaluations for the computationally expensive application of aerodynamic design. Another algorithm called M-QD [18] later follows up on this idea and used neural network models that map the parameter space to the behaviour and fitness space as a surrogate model. They demonstrate this on robotic pushing and placing tasks.…”
Section: Model-based Quality-diversitymentioning
confidence: 99%