2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561814
|View full text |Cite
|
Sign up to set email alerts
|

Sim-to-Real Learning of All Common Bipedal Gaits via Periodic Reward Composition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
70
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 101 publications
(70 citation statements)
references
References 17 publications
0
70
0
Order By: Relevance
“…These complex behaviors can be composed of a single framework using pre-trained expert networks and a gate neural network, and manifest agile and effective motions [7] on the real robot. Furthermore, RL can be utilized for bipedal robots to climb up stairs [8] or to display diverse locomotion patterns such as standing, walking, and running [9].…”
Section: Introductionmentioning
confidence: 99%
“…These complex behaviors can be composed of a single framework using pre-trained expert networks and a gate neural network, and manifest agile and effective motions [7] on the real robot. Furthermore, RL can be utilized for bipedal robots to climb up stairs [8] or to display diverse locomotion patterns such as standing, walking, and running [9].…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning (RL) based approaches have begun to show significant promise at robust real-world legged locomotion [1,12,13]. Unlike optimization or heuristic-based control methods which rely on prescribed ground contact schedules or force-based event detection, RL can produce control policies which learn proprioceptive reflexes and strategies for dealing with unexpectedly early or late contact and rough terrain through exposure to a variety of disturbances during training.…”
Section: Learned Controllermentioning
confidence: 99%
“…In the remainder of this section, we detail the specific simto-real RL formulation used in this work, which follows recent work [13] on learning different biped gaits over flat terrain. Surprisingly, only minimal changes were required to enable policy learning for the much more complex stair-like terrains of this paper 1 In particular, the only major modification required was the randomized domain generation of stair-like rather than mostly flat terrain as discussed later in Section III; no novel stair-specific reward terms were needed.…”
Section: Reinforcement Learning Formulationmentioning
confidence: 99%
See 2 more Smart Citations