Learning force-based manipulation of deformable objects from multiple demonstrations

Lee, Alex X.; Lu, Henry Horng-Shing; Gupta, Abhishek; Levine, Sergey; Abbeel, Pieter

doi:10.1109/icra.2015.7138997

Cited by 111 publications

(77 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…al. [6] employed machine learning for registering demonstrations to a new situation. More precisely, at first, multiple demonstrations were shown to the robot.…”

Section: Related Workmentioning

confidence: 99%

“…(6). Note that cov(x x x t , y y y t ) and cov(y y y t , x x x t ) can be analytically computed for the given policy.…”

Section: A Probabilistic Inference For Learning Controlmentioning

confidence: 99%

“…On the other hand, machine learning has been employed to achieved DLO manipulation by means of a shown demonstration in advance [6]. This machine learning-based ap- proach avoids the need for a high-dimensionality DLO model by focusing on learning by registering demonstrations into a new situation.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Model-based reinforcement learning approach for deformable linear object manipulation

Han

Paul

Matsubara

2017

2017 13th IEEE Conference on Automation Science and Engineering (CASE)

View full text Add to dashboard Cite

Abstract-Deformable Linear Object (DLO) manipulation has wide application in industry and in daily life. Conventionally, it is difficult for a robot to manipulate a DLO to achieve the target configuration due to the absence of the universal model that specifies the DLO regardless of the material and environment. Since the state variable of a DLO can be very high dimensional, identifying such a model may require a huge number of samples. Thus, model-based planning of DLO manipulation would be impractical and unreasonable. In this paper, we explore another approach based on reinforcement learning. To this end, our approach is to apply a sampleefficient model-based reinforcement learning method, so-called PILCO [1], to resolve the high dimensional planning problem of DLO manipulation with a reasonable number of samples. To investigate the effectiveness of our approach, we developed an experimental setup with a dual-arm industrial robot and multiple sensors. Then, we conducted experiments to show that our approach is efficient by performing a DLO manipulation task.

show abstract

“…al. [6] employed machine learning for registering demonstrations to a new situation. More precisely, at first, multiple demonstrations were shown to the robot.…”

Section: Related Workmentioning

confidence: 99%

“…(6). Note that cov(x x x t , y y y t ) and cov(y y y t , x x x t ) can be analytically computed for the given policy.…”

Section: A Probabilistic Inference For Learning Controlmentioning

confidence: 99%

See 1 more Smart Citation

Model-based reinforcement learning approach for deformable linear object manipulation

Han

Paul

Matsubara

2017

2017 13th IEEE Conference on Automation Science and Engineering (CASE)

View full text Add to dashboard Cite

show abstract

“…In this paper, we study how to improve an imitated wood planing task which exhibits an extremely complex dynamic interaction; every cut will generate a new environment, which does not allow for generalization [8], [9], [10] or planning [11] of the planing task. Hence, we apply the trial and error framework of reinforcement learning (RL) for improving the performance of the imitated planing skill.…”

Section: Reinforcement Learning For Improving Imitated In-contact Skillsmentioning

confidence: 99%

Reinforcement learning for improving imitated in-contact skills

Hazara

Kyrki

2016

2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids)

View full text Add to dashboard Cite

“…In the field of LfD, it has been proposed to derive stiffness variations via kinematic variability in demonstrated data [36]. Recent works have taken force measurements into account for estimating stiffness using weighted least squares [37] and least squares with platform specific priors on the stiffness parameters [38]. Another approach developed dedicated Human-Robot interfaces for the purpose of enabling stiffness variations to be easily taught to a robot [17].…”

Section: Learning Varying Stiffness Controlmentioning

confidence: 99%

Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies

Rey

Kronander

Farshidian³

et al. 2017

Auton Robot

View full text Add to dashboard Cite

An important challenge when using Reinforcement Learning for learning motions in robotics is the choice of parameterization for the policy. We use Gaussian Mixture Regression to extract a parameterization with relevant non-linear features from a set of demonstrations of a motion following the paradigm of Learning from Demonstration. The resulting parameterization takes the form of a non-linear time-invariant dynamical system (DS). We use this time-invariant DS as a parameterized policy for a variant of the PI 2 policy search algorithm. This paper contributes by adapting PI 2 for our time-invariant motion representation. We introduce two novel parameter exploration schemes that can be used to 1) sample model parameters to achieve a uniform exploration in state space and 2) explore while ensuring stability of the resulting motion model. Additionally, a state dependent stiffness profile is learned simultaneously to the reference trajectory and both are used together in a variable impedance control architecture. This learning architecture is validated in a hardware experiment consisting of a digging task using a KUKA LWR platform.

show abstract

Learning force-based manipulation of deformable objects from multiple demonstrations

Cited by 111 publications

References 32 publications

Model-based reinforcement learning approach for deformable linear object manipulation

Model-based reinforcement learning approach for deformable linear object manipulation

Reinforcement learning for improving imitated in-contact skills

Learning motions from demonstrations and rewards with time-invariant dynamical systems based policies

Contact Info

Product

Resources

About