2019
DOI: 10.48550/arxiv.1907.04539
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Simple Kinematic Feedback Enhances Autonomous Learning in Bio-Inspired Tendon-Driven Systems

Abstract: Error feedback is known to improve performance by correcting control signals in response to perturbations. Here we show how adding simple error feedback can also accelerate and robustify autonomous learning in a tendondriven robot. We implemented two versions of the Generalto-Particular (G2P) autonomous learning algorithm to produce multiple movement tasks using a tendon-driven leg with two joints and three tendons: one with and one without kinematic feedback. As expected, feedback improved performance in simu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(9 citation statements)
references
References 27 publications
0
9
0
Order By: Relevance
“…In this paper, we studied how adding elastic elements affects autonomous learning in a two-joint three-tendons simulated limb (similar to [23], [25]) in the MuJoCo environment [32](Fig. 1.a).…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…In this paper, we studied how adding elastic elements affects autonomous learning in a two-joint three-tendons simulated limb (similar to [23], [25]) in the MuJoCo environment [32](Fig. 1.a).…”
Section: Methodsmentioning
confidence: 99%
“…G2P is a hierarchical autonomous learning algorithm that, on its lower-level, creates an inverse kinematics map using output kinematics collected from an initial random set of actuation commands (motor babbling). Systems that use an explicit kinematics model are, in general, easier to study and interpret, more data efficient and can generalize to a wider range of tasks; however, they can suffer from inaccuracies in the model especially during complex dynamical interactions (e.g., contact dynamics, injury to the body, or changes in the environment) [25], [23], [34], [35], [36], [37]. Systems that perform end-to-end learning (such as PPO), on the other hand, usually require larger number of samples to learn to perform a task, are harder to interpret due to their implicit modeling, and usually cannot generalize well across tasks [38], [39], [33], [40], [41].…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations