Simple Kinematic Feedback Enhances Autonomous Learning in Bio-Inspired Tendon-Driven Systems

Marjaninejad, Ali; Urbina-Meléndez, Darío; Valero-Cuevas, Francisco J.

doi:10.48550/arxiv.1907.04539

Cited by 1 publication

(9 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this paper, we studied how adding elastic elements affects autonomous learning in a two-joint three-tendons simulated limb (similar to [23], [25]) in the MuJoCo environment [32](Fig. 1.a).…”

Section: Methodsmentioning

confidence: 99%

“…G2P is a hierarchical autonomous learning algorithm that, on its lower-level, creates an inverse kinematics map using output kinematics collected from an initial random set of actuation commands (motor babbling). Systems that use an explicit kinematics model are, in general, easier to study and interpret, more data efficient and can generalize to a wider range of tasks; however, they can suffer from inaccuracies in the model especially during complex dynamical interactions (e.g., contact dynamics, injury to the body, or changes in the environment) [25], [23], [34], [35], [36], [37]. Systems that perform end-to-end learning (such as PPO), on the other hand, usually require larger number of samples to learn to perform a task, are harder to interpret due to their implicit modeling, and usually cannot generalize well across tasks [38], [39], [33], [40], [41].…”

Section: Methodsmentioning

confidence: 99%

“…The recorded kinematics are joint angles, angular velocities, and angular accelerations for both joints (a vector of 6 values). Next, we trained a Multi-Layer Perceptron (MLP) Artificial Neural Network (ANN; one hidden layer with 15 neurons; trained for 20 epochs; 80% training 20% validation; loss function: MSE, optimizer: ADAM) with kinematics as input and activations as output to form the inverse kinematics map (similar to [23], [25]). Finally, this inverse map was used to control the system to perform two tasks: Cyclical and Point-to-point movements.…”

Section: A Simulated Experimentsmentioning

confidence: 99%

“…For each joint, we calculate the error as the Root Mean Square Error (RMSE) of the difference between the joint angle and the desired angle in Radians. We disregard the error for the first 25% of the signal to make sure any initial condition effect is washed out [23], [25].…”

Section: A Simulated Experimentsmentioning

confidence: 99%

“…Thus, it can be challenging to find solutions that satisfy all the constraints imposed by tendons and by task specifications at the same time [23], [20]. Moreover, the fact that their actuators are not directly operating on the degrees of freedom (as is the case in joint-driven systems), makes it challenging to use an off the shelf controller (such as a simple PID setup) without having access to dynamical equations of the system or a forward or inverse kinematics model [25]. Also, these tendon-driven systems often require accurate modeling and control strategies for applications such as animation of lifelike figures [26], control of anatomical limbs to understand neurological conditions [27], [28], [29], or functional electrical stimulation of limbs (e.g., [30] or [31]).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Autonomous Control of a Tendon-driven Robotic Limb with Elastic Elements Reveals that Added Elasticity can Enhance Learning

Marjaninejad¹,

Tan²,

Valero-Cuevas³

2019

Preprint

Self Cite

View full text Add to dashboard Cite

Passive elastic elements can contribute to stability, energetic efficiency, and impact absorption in both biological and robotic systems. They also add dynamical complexity which makes them more challenging to model and control. The impact of this added complexity to autonomous learning has not been thoroughly explored. This is especially relevant to tendon-driven limbs whose cables and tendons are inevitably elastic. Here, we explored the efficacy of autonomous learning and control on a simulated bio-plausible tendon-driven leg across different tendon stiffness values. We demonstrate that increasing stiffness of the simulated muscles can require more iterations for the inverse map to converge but can then perform more accurately, especially in discrete tasks. Moreover, the system is robust to subsequent changes in muscle stiffnesses and can adapt on-the-go within 5 attempts. Lastly, we test the system for the functional task of locomotion, and found similar effects of muscle stiffness to learning and performance. Given that a range of stiffness values led to improved learning and maximized performance, we conclude the robot bodies and autonomous controllers-at least for tendon-driven systemscan be co-developed to take advantage of elastic elements. Importantly, this opens also the door to development efforts that recapitulate the beneficial aspects of the co-evolution of brains and bodies in vertebrates.

show abstract

Section: Methodsmentioning

confidence: 99%