Proceedings of the Genetic and Evolutionary Computation Conference Companion 2021
DOI: 10.1145/3449726.3463171
|View full text |Cite
|
Sign up to set email alerts
|

Behavior-based neuroevolutionary training in reinforcement learning

Abstract: In addition to their undisputed success in solving classical optimization problems, neuroevolutionary and population-based algorithms have become an alternative to standard reinforcement learning methods. However, evolutionary methods often lack the sample efficiency of standard value-based methods that leverage gathered state and value experience. If reinforcement learning for real-world problems with significant resource cost is considered, sample efficiency is essential. The enhancement of evolutionary algo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 29 publications
0
1
0
Order By: Relevance
“…The bnet algorithm (Stork et al, 2021) is borderline in this survey as it does not truly uses an RL algorithm, but uses a Behavior-Based Neuroevolution (BBNE) mechanism which is only loosely inspired from RL algorithms, without relying on gradient descent. bnet combines a robust selection method based on standard fitness, a second mechanism based on the advantage of the behavior of an agent, and a third mechanism based on a surrogate estimate of the return of policies.…”
Section: Deep Rl Actor Injectionmentioning
confidence: 99%
“…The bnet algorithm (Stork et al, 2021) is borderline in this survey as it does not truly uses an RL algorithm, but uses a Behavior-Based Neuroevolution (BBNE) mechanism which is only loosely inspired from RL algorithms, without relying on gradient descent. bnet combines a robust selection method based on standard fitness, a second mechanism based on the advantage of the behavior of an agent, and a third mechanism based on a surrogate estimate of the return of policies.…”
Section: Deep Rl Actor Injectionmentioning
confidence: 99%