2014
DOI: 10.1142/s0219843614500121
|View full text |Cite
|
Sign up to set email alerts
|

Full-Body Postural Control of a Humanoid Robot with Both Imitation Learning and Skill Innovation

Abstract: In this paper, we present a novel methodology to obtain imitative and innovative postural movements in a humanoid based on human demonstrations in a di®erent kinematic scale. We collected motion data from a group of human participants standing up from a chair. Modeling the human as an actuated 3-link kinematic chain, and by de¯ning a multiobjective reward function of zero moment point and joint torques to represent the stability and e®ort, we computed reward pro¯les for each demonstration. Since individual rew… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 48 publications
0
10
0
Order By: Relevance
“…Furthermore, some recent studies suggested that the main difference between humans and chimpanzees is the ability of over-imitation [3]. Our proposal of using a reward profile to solve the correspondence problem in order to transfer a complex behavior from a human to a humanoid is based on these previous neuroscience works and previous experiments performed in a real humanoid standing up from a chair [17]. However, we are not sure of what is the internal objective function that the brain is optimizing Figure 10.…”
Section: Discussion On the Reward Profilementioning
confidence: 94%
See 4 more Smart Citations
“…Furthermore, some recent studies suggested that the main difference between humans and chimpanzees is the ability of over-imitation [3]. Our proposal of using a reward profile to solve the correspondence problem in order to transfer a complex behavior from a human to a humanoid is based on these previous neuroscience works and previous experiments performed in a real humanoid standing up from a chair [17]. However, we are not sure of what is the internal objective function that the brain is optimizing Figure 10.…”
Section: Discussion On the Reward Profilementioning
confidence: 94%
“…The generality comes from the definition of the reward profile. In fact, any behavior can be modeled, from simple ones as in [17] to complex behaviors. The preference in the selection of a predefined reward function over a learned function like in inverse reinforcement learning [29,30] does not affect the general idea of comparing the behavior of a human and a robot in a common domain, which is the reward domain.…”
Section: Discussion On the Reward Profilementioning
confidence: 99%
See 3 more Smart Citations