2020
DOI: 10.48550/arxiv.2004.00784
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning Agile Robotic Locomotion Skills by Imitating Animals

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
114
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 51 publications
(114 citation statements)
references
References 0 publications
0
114
0
Order By: Relevance
“…Demonstrations have long been used in dealing with exploration issues in reinforcement learning for robotic tasks [11], [12], [13]. Recently, demonstrations have been used as one of the main ingredients to learn complicated locomotion behaviors in a DRL setting [14], [15]. In these works, a policy is trained using reinforcement learning to reproduce a given demonstration trajectory as well as possible in simulation.…”
Section: A Related Workmentioning
confidence: 99%
“…Demonstrations have long been used in dealing with exploration issues in reinforcement learning for robotic tasks [11], [12], [13]. Recently, demonstrations have been used as one of the main ingredients to learn complicated locomotion behaviors in a DRL setting [14], [15]. In these works, a policy is trained using reinforcement learning to reproduce a given demonstration trajectory as well as possible in simulation.…”
Section: A Related Workmentioning
confidence: 99%
“…Our work is inspired by recent methods for animated character control via RL-based imitation of motion capture data [23]- [25]. Motion capture data has also been used to transfer dog behaviors to a real quadruped platform [10] or to provide adversarially learned motion priors [26]. Holden et al [27] use terrains that were fitted to existing motion capture trajectories for kinematic replay.…”
Section: Character Animation and Mocap Trackingmentioning
confidence: 99%
“…All units are centimeters. The spaces separating each pair of steps laterally were sampled from (10,15), the vertical offset and longitudinal offset of each step from (−5, 5), the length of the gaps in the longitudinal direction from (15,20) and the length and the width of the steps of each pair from (20, 40) and (30, 60) respectively. The steps were 72 cm high and we applied random rotations on the pitch and roll axes with values sampled from (−0.15, 0.15) radians.…”
Section: Wavy Stepsmentioning
confidence: 99%
See 1 more Smart Citation
“…The capability of blind walking, i.e., by using propioceptive feedback only, has achieved remarkable performance using either mixed-integer nonlinear optimization [1], [2] and mathematical model-based control [3]- [6], or modelfree learning which heavily utilizes physics simulations to train control policies by reinforcement learning [7]- [12] or imitation learning [13]- [15].…”
Section: Introductionmentioning
confidence: 99%