2024
DOI: 10.1109/tase.2023.3254583
|View full text |Cite
|
Sign up to set email alerts
|

Proximal Policy Optimization With Time-Varying Muscle Synergy for the Control of an Upper Limb Musculoskeletal System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…Another study [30] developed a deep Q networkbased controller for a musculoskeletal arm model, employing a phased target-learning framework and human-inspired noise for stable and efficient exploration of the solution space. Complex reinforcement learning algorithms such as Deep Deterministic Policy Gradient [67], proximal policy optimisation [68], and Soft Actor-Critics [85] have also shown promising control results.…”
Section: Reinforcement Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…Another study [30] developed a deep Q networkbased controller for a musculoskeletal arm model, employing a phased target-learning framework and human-inspired noise for stable and efficient exploration of the solution space. Complex reinforcement learning algorithms such as Deep Deterministic Policy Gradient [67], proximal policy optimisation [68], and Soft Actor-Critics [85] have also shown promising control results.…”
Section: Reinforcement Learningmentioning
confidence: 99%
“…The only prior work that undergoes this retraining process is constrained to joint space control [65]. While reinforcement learning approaches have proven successful in highly redundant musculoskeletal systems [66][67][68], they often mandate complex reward function design and laborious retraining after muscle failure.…”
Section: Introductionmentioning
confidence: 99%