2020
DOI: 10.1126/scirobotics.abc5986
|View full text |Cite
|
Sign up to set email alerts
|

Learning quadrupedal locomotion over challenging terrain

Abstract: Legged locomotion can extend the operational domain of robots to some of the most challenging environments on Earth. However, conventional controllers for legged locomotion are based on elaborate state machines that explicitly trigger the execution of motion primitives and reflexes. These designs have increased in complexity but fallen short of the generality and robustness of animal locomotion. Here, we present a robust controller for blind quadrupedal locomotion in challenging natural environments. Our appro… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
598
2
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 751 publications
(605 citation statements)
references
References 42 publications
4
598
2
1
Order By: Relevance
“…In one neurally inspired legged robot, a readout map was trained through reward delivery to transform sensory inputs into modulatory changes to a hierarchically lower dynamic oscillator, such that a simulated robot could complete climbs over difficult terrain (1). In other robots, RL is performed in simulation and the policy is then uploaded to a physical body to successfully generate locomotion (13, 24, 32, 43). RL for locomotion has also been successfully performed online with no simulation (18).…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…In one neurally inspired legged robot, a readout map was trained through reward delivery to transform sensory inputs into modulatory changes to a hierarchically lower dynamic oscillator, such that a simulated robot could complete climbs over difficult terrain (1). In other robots, RL is performed in simulation and the policy is then uploaded to a physical body to successfully generate locomotion (13, 24, 32, 43). RL for locomotion has also been successfully performed online with no simulation (18).…”
Section: Discussionmentioning
confidence: 99%
“…Every module, with the exception of the burst generators, independently and continuously seeks to match its perception with its goal value through corrective outputs in real time. Unlike designs that only implement feedback control at the level of joint control and some form of feed-forward computation above that to generate behavior (11,14, 22, 25, 28, 32, 43, 45, 50), our design uses closed loop negative feedback control at every level of the hierarchy. This feature replicates the purposive nature of animal behavior, which is often mistakenly assumed to be a feedforward process whereby a stimulus input is transformed by the nervous system and results in motor output (51).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In the past, several works have been proposed for the in-situ classification of terrain types, based exclusively on proprioceptive data while actually experiencing the traverse on the (potentially dangerous) terrain [ 88 , 89 ]. Among the works reported in the present survey, a recent and more frequently adopted approach consists of using proprioceptive data during training as well as for data labeling [ 34 , 35 , 66 ].…”
Section: Discussionmentioning
confidence: 99%
“…Though there are few challenges targeted towards learning for robotics, learning methods have been applied in a number of other robotics challenges. A learned locomotion controller for a quadrupedal robot was recently deployed in the DARPA Subterranean challenge [124], for example.…”
Section: Learning For Roboticsmentioning
confidence: 99%