Tactile Guidance for Policy Adaptation

Argall, Brenna; Sauser, Eric L.

doi:10.1561/2300000012

Cited by 17 publications

(11 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, a human teacher might supervise the learning process, by modifying targets learned from demonstration [9] or resolving ambiguities in goal representations [10]. Datasets are iteratively built by providing new demonstrations in areas of low policy prediction confidence [40], [41], by providing explicit corrections on policy predictions to generate new data [40], [12] and by physically touching a robot during execution to provide kinesthetic corrections [11], [42], [13].…”

Section: Robot Learningmentioning

confidence: 99%

See 1 more Smart Citation

Iterative learning of grasp adaptation through human corrections

Sauser

Argall

Metta

et al. 2012

Robotics and Autonomous Systems

Self Cite

View full text Add to dashboard Cite

Abstract-In the context of object interaction and manipulation, one characteristic of a robust grasp is its ability to comply with external perturbations applied to the grasped object while still maintaining the grasp. In this work we introduce an approach for grasp adaptation which learns a statistical model to adapt hand posture solely based on the perceived contact between the object and fingers. Using a multi-step learning procedure, the model dataset is built by first demonstrating an initial hand posture, which is then physically corrected by a human teacher pressing on the fingertips, exploiting compliance in the robot hand. The learner then replays the resulting sequence of hand postures, to generate a dataset of posture-contact pairs that are not influenced by the touch of the teacher. A key feature of this work is that the learned model may be further refined by repeating the correction-replay steps. Alternatively, the model may be reused in the development of new models, characterized by the contact signatures of a different object. Our approach is empirically validated on the iCub robot. We demonstrate grasp adaptation in response to changes in contact, and show successful model reuse and improved adaptation with additional rounds of model refinement.

show abstract

Section: Robot Learningmentioning

confidence: 99%

“…Furthermore, our executions do not depend on time (unlike [11], [42], [13]), as our goal is not to execute a trajectory but rather to respond online to changes in contact with an object.…”

Section: Robot Learningmentioning

confidence: 99%

Iterative learning of grasp adaptation through human corrections

Sauser

Argall

Metta

et al. 2012

Robotics and Autonomous Systems

Self Cite

View full text Add to dashboard Cite

show abstract

“…The Tactile Policy Correction (TPC) algorithm offers an approach for the adaptation of a demonstrated policy, using tactile feedback from a human teacher [3]. Corrections are provided in order to accomplish two goals (Fig.…”

Section: Algorithm Overviewmentioning

confidence: 99%

“…For our initial empirical validations of the TPC algorithm [3], the tactile correction interface consisted of Ergonomic Touchpads encircling the wrist of a manipulator arm, with validation on grasp positioning tasks. Comparisons to policies derived from solely teleoperation demonstration confirmed policy reuse to be an effective mechanism for transferring domain knowledge, and policy refinement to be more successful at improving performance.…”

Section: Which First Encodes Demonstrations In a Gaussian Mixture Modmentioning

confidence: 99%

Learning from Demonstration and Correction via Multiple Modalities for a Humanoid Robot

Argall

Billard

2011

BIO Web of Conferences

View full text Add to dashboard Cite

show abstract

“…Tactile feedback is used to assist in both policy refinement and the reuse of a demonstrated policy when developing a different policy; effectively using the demonstrated policy as prior knowledge for a new behavior. Empirical validation has included grasp positioning on the iCub humanoid [4], as well as grasp adaptation in response to changes in fingertip contact [13].…”

Section: A High-dof Humanoidmentioning

confidence: 99%