2023
DOI: 10.3390/biomimetics8050382
|View full text |Cite
|
Sign up to set email alerts
|

A Hierarchical Framework for Quadruped Robots Gait Planning Based on DDPG

Yanbiao Li,
Zhao Chen,
Chentao Wu
et al.

Abstract: In recent years, significant progress has been made in employing reinforcement learning for controlling legged robots. However, a major challenge arises with quadruped robots due to their continuous states and vast action space, making optimal control using simple reinforcement learning controllers particularly challenging. This paper introduces a hierarchical reinforcement learning framework based on the Deep Deterministic Policy Gradient (DDPG) algorithm to achieve optimal motion control for quadruped robots… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 21 publications
0
4
0
Order By: Relevance
“…Experimental results using the BHR7P bipedal robot validate the effectiveness of these proposed methods. The paper by Yanbiao [4] presents a hierarchical reinforcement learning framework based on the Deep Deterministic Policy Gradient (DDPG) algorithm. This framework involves a high-level planner that generates ideal motion parameters, a lowlevel controller using model predictive control (MPC), and a trajectory generator.…”
Section: Discussion Of the Papersmentioning
confidence: 99%
“…Experimental results using the BHR7P bipedal robot validate the effectiveness of these proposed methods. The paper by Yanbiao [4] presents a hierarchical reinforcement learning framework based on the Deep Deterministic Policy Gradient (DDPG) algorithm. This framework involves a high-level planner that generates ideal motion parameters, a lowlevel controller using model predictive control (MPC), and a trajectory generator.…”
Section: Discussion Of the Papersmentioning
confidence: 99%
“…It can be adept at dealing with the high-dimensional action spaces commonly encountered in real-world applications like robotics, autonomous vehicles, and complex control systems. The soft target updates employed by DDPG also contribute to more robust and reliable learning, including controlling a car's throttle [25] or a robot's joint angles [26]. Furthermore, the improved version of DDPG has been proposed to enhance learning efficiency.…”
Section: Introductionmentioning
confidence: 99%
“…To enhance the adaptability of these learning strategies while reducing their computational demands, current research trends towards integrating reinforcement learning with conventional control methods [ 17 ]. For example, Li and colleagues developed a hierarchical framework for quadruped robots’ gait planning that combines the DDPG algorithm with Model Predictive Control (MPC), achieving optimal action control [ 18 ]. Moreover, since the introduction of the soft actor-critic (SAC) algorithm, reinforcement learning-based gait control methods have made significant strides in the field of robotics [ 19 ].…”
Section: Introductionmentioning
confidence: 99%