2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL) 2013
DOI: 10.1109/devlrn.2013.6652526
|View full text |Cite
|
Sign up to set email alerts
|

Goal babbling with unknown ranges: A direction-sampling approach

Abstract: Goal babbling is a recent concept for the efficient bootstrapping of sensorimotor coordination that is inspired by infants' early goal-directed movement attempts. Several studies have shown its superior performance compared to random motor babbling. Yet, previous implementations of goal babbling require knowledge of a set of achievable goals in advance. This paper introduces an approach to goal babbling that can bootstrap coordination skills without pre-specifying, or even representing, a set of goals. On the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 18 publications
(21 citation statements)
references
References 19 publications
0
21
0
Order By: Relevance
“…We design the scalar reward dependent upon the Euclidean distance from the goal such that it has a high negative value for distances further away from the goal that progressively but discretely decreases towards target, where it receives a reward of 0. The motivation behind this reward structure is to enable the robotic arm to make goal-directed attempts [22]. The episode will end either when the goal is reached or the maximum number of trials per episode are reached.…”
Section: B Reward-guided Actor-critic Architecturementioning
confidence: 99%
“…We design the scalar reward dependent upon the Euclidean distance from the goal such that it has a high negative value for distances further away from the goal that progressively but discretely decreases towards target, where it receives a reward of 0. The motivation behind this reward structure is to enable the robotic arm to make goal-directed attempts [22]. The episode will end either when the goal is reached or the maximum number of trials per episode are reached.…”
Section: B Reward-guided Actor-critic Architecturementioning
confidence: 99%
“…Many other exploration strategies could be easily integrated into the library, as for example Direction Sampling [7], compression progress [4], empowerment [8] and thus be compared in a proper way on various sensorimotor systems and using various sensorimotor internal models.…”
Section: Discussionmentioning
confidence: 99%
“…The underlying algorithm for the exploration is goal babbling, a method for bootstrapping an inverse model for a motor coordination task [7], [23], [24], [8]. Goal babbling operates in the space of outcomes.…”
Section: B Goal-directed Explorationmentioning
confidence: 99%