2017
DOI: 10.3389/frobt.2017.00062
|View full text |Cite
|
Sign up to set email alerts
|

Evolutionary Policy Transfer and Search Methods for Boosting Behavior Quality: RoboCup Keep-Away Case Study

Abstract: This study evaluates various evolutionary search methods to direct neural controller evolution in company with policy (behavior) transfer across increasingly complex collective robotic (RoboCup keep-away) tasks. Robot behaviors are first evolved in a source task and then transferred for further evolution to more complex target tasks. Evolutionary search methods tested include objective-based search (fitness function), behavioral and genotypic diversity maintenance, and hybrids of such diversity maintenance and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(3 citation statements)
references
References 64 publications
0
3
0
Order By: Relevance
“…Another variation of incremental learning called transfer learning is used in a slightly different context, as the knowledge of the task learned formerly would be subsequently used in learning a related but different task. 18 These concepts have been widely studied in the context of RL, [19][20][21] neuro-evolution 22,23 and genetic programming. 24 There have also been investigations with multi-objective optimisation 25 using incremental learning architectures.…”
Section: Abstraction Learning Architecturesmentioning
confidence: 99%
“…Another variation of incremental learning called transfer learning is used in a slightly different context, as the knowledge of the task learned formerly would be subsequently used in learning a related but different task. 18 These concepts have been widely studied in the context of RL, [19][20][21] neuro-evolution 22,23 and genetic programming. 24 There have also been investigations with multi-objective optimisation 25 using incremental learning architectures.…”
Section: Abstraction Learning Architecturesmentioning
confidence: 99%
“…Keepaway is a subtask of RoboCup that was put forth as a testbed for machine learning in 2001 (Stone & Sutton, 2001). It has since been used for research on temporal difference reinforcement learning with function approximation (Stone, Sutton, & Kuhlmann, 2005), evolutionary learning (Pietro et al, 2002), relational reinforcement learning (Walker et al, 2004), behaviour transfer (Cheng et al, 2018;Didi & Nitschke, 2016a, 2016bNitschke & Didi, 2017;Schwab et al, 2018;, batch reinforcement learning (Riedmiller et al, 2009) and hierarchical reinforcement learning (Bai & Russell, 2017).…”
Section: Robocup Keepaway Soccermentioning
confidence: 99%
“…Keepaway is a subtask of RoboCup that was put forth as a testbed for machine learning in 2001 [17]. It has since been used for research on temporal difference reinforcement learning with function approximation [46], evolutionary learning [47], relational reinforcement learning [48], behaviour transfer [49,50,51,52,53,54,55], batch reinforcement learning [56] and hierarchical reinforcement learning [57].…”
Section: Robocup Keepaway Soccermentioning
confidence: 99%