2021
DOI: 10.48550/arxiv.2107.05627
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hierarchical Neural Dynamic Policies

Abstract: We tackle the problem of generalization to unseen configurations for dynamic tasks in the real world while learning from high-dimensional image input. The family of nonlinear dynamical system-based methods have successfully demonstrated dynamic robot behaviors but have difficulty in generalizing to unseen configurations as well as learning from image inputs. Recent works approach this issue by using deep network policies and reparameterize actions to embed the structure of dynamical systems but still struggle … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 33 publications
0
4
0
Order By: Relevance
“…The task suite consists of two manipulation tasks that humans encounter every day, similar to those introduced in prior work [4], [25]. The tasks are pouring and scooping, excluding the easiest and hardest RB2 tasks (zipping and insertion).…”
Section: B Tasksmentioning
confidence: 99%
“…The task suite consists of two manipulation tasks that humans encounter every day, similar to those introduced in prior work [4], [25]. The tasks are pouring and scooping, excluding the easiest and hardest RB2 tasks (zipping and insertion).…”
Section: B Tasksmentioning
confidence: 99%
“…Physics values extracted/ derived from the environ-ment or system has been directly used by RL agents in form of physics parameters [113], dynamic movement (physics) primitives [3], physical state [59] and physical target [75]. For example in [75], the reward is created to meet two physical objectives/ targets: operation cost and self-energy sustainability.…”
Section: Physics Information (Types): Representation Of Physics Priorsmentioning
confidence: 99%
“…Extending this work to hierarchical deep policy learning framework, [3] introduced H-NDP which forms a curriculum by learning local dynamical system-based policies on small state-space region and then refines them into global dynamical system based policy. Given the accurate dynamics and constraint of the system [140] introduces control barrier certificates into actor-critic RL framework, for learning safe policies in dynamical systems.…”
Section: Augment Policy And/or Value N/wmentioning
confidence: 99%
“…These models are time-discrete, which leads to performance degradation with high frequencies [3]. Other research proposes continuous-time approaches that use a deep model to generate the parameters of movement primitive models [3], [2], but the fundamental movements are still implemented with a linear model.…”
Section: Introductionmentioning
confidence: 99%