2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021
DOI: 10.1109/iros51168.2021.9636467
|View full text |Cite
|
Sign up to set email alerts
|

Robust Feedback Motion Policy Design Using Reinforcement Learning on a 3D Digit Bipedal Robot

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 53 publications
(27 citation statements)
references
References 17 publications
0
27
0
Order By: Relevance
“…It can even prevent learning anything if the variability is not increased progressively [10]. Alternatively, the stability can be improved by predicting high-level features for a modelbased controller [32], but it bounds the overall performance. Besides, a memory network or a history of previous timesteps is often used to thwart partial observability of the state [10], [31].…”
Section: Simulation To Real World Transfermentioning
confidence: 99%
“…It can even prevent learning anything if the variability is not increased progressively [10]. Alternatively, the stability can be improved by predicting high-level features for a modelbased controller [32], but it bounds the overall performance. Besides, a memory network or a history of previous timesteps is often used to thwart partial observability of the state [10], [31].…”
Section: Simulation To Real World Transfermentioning
confidence: 99%
“…The gait controller is responsible for keeping track of the gait parameters and tracking the generated foot trajectory. Based on the contact state of the leg (estimated as explained in Section II-A.3), the gait controller augments the ankle regulation followed by joint level PD tracking for the swing leg and just the torso regulation for the stance leg, as in [13]. A phase-variable τ, τ ∈ [0, 1) which is used to track the semi-elliptical trajectory gets reset once every walking step or upon a premature foot contact.…”
Section: Gait Controllermentioning
confidence: 99%
“…The swing foot is kept parallel to the underlying terrain elevation to ensure proper landing on the ground. The desired position of the swing foot is determined from the kinematics of the robot's leg [13] as follows: where q d tr and q d tp are the target angles for the ankle roll and pitch joints. The value of S f is defined as follows, S f = −1 left leg in swing phase +1 right leg in swing phase Torso Regulation: The torso regulation is applied to ensure an upright torso, which is desired for a stable walking gait and, more importantly, to prevent the stance leg from sliding.…”
Section: Gait Controllermentioning
confidence: 99%
See 2 more Smart Citations