2022
DOI: 10.48550/arxiv.2206.00484
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems

Abstract: Muscle-actuated organisms are capable of learning an unparalleled diversity of dexterous movements despite their vast amount of muscles. Reinforcement learning (RL) on large musculoskeletal models, however, has not been able to show similar performance. We conjecture that ineffective exploration in large overactuated action spaces is a key problem. This is supported by the finding that common exploration noise strategies are inadequate in synthetic examples of overactuated systems. We identify differential ext… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…Further, human physiology is multiarticular, meaning that there exist many-to-one and one-to-many relationships between muscles and joints (e.g., the flexor digitorum profundus muscle controls over 3 joints for each digit of the hand) [27]. Critically, the human hand is also overactuated, meaning that there are more muscle forces than degrees of freedom (i.e., approximately 39 muscles collectively control 23 joints in the human hand) [49].…”
Section: Introductionmentioning
confidence: 99%
“…Further, human physiology is multiarticular, meaning that there exist many-to-one and one-to-many relationships between muscles and joints (e.g., the flexor digitorum profundus muscle controls over 3 joints for each digit of the hand) [27]. Critically, the human hand is also overactuated, meaning that there are more muscle forces than degrees of freedom (i.e., approximately 39 muscles collectively control 23 joints in the human hand) [49].…”
Section: Introductionmentioning
confidence: 99%