High precision grasp pose detection in dense clutter

Gualtieri, Marcus; Pas, Andreas ten; Saenko, Kate; Platt, Robert

doi:10.1109/iros.2016.7759114

Cited by 275 publications

(258 citation statements)

References 17 publications

Supporting

Mentioning

255

Contrasting

Unclassified

Order By: Relevance

“…Rather than directly operating on randomised images, RCAN [8] is a recent approach that instead translate randomised rendered images into their equivalent non-randomised, canonical versions, producing superior results on a complex sim-to-real grasping task. Rather than operating on RGB images, other works have instead used depth images to cross the domain gap [37], [38]; however, in our tasks, the colour of an object is an important feature when inferring what object the robot needs to interact with, particularly when the geometry of the objects are very similar. In our work, we show that domain randomisation can be leveraged to transfer the ability to infer actions from human demonstrations.…”

Section: Related Workmentioning

confidence: 99%

Learning One-Shot Imitation From Humans Without Humans

Bonardi

James

Davison

2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

Humans can naturally learn to execute a new task by seeing it performed by other individuals once, and then reproduce it in a variety of configurations. Endowing robots with this ability of imitating humans from third person is a very immediate and natural way of teaching new tasks. Only recently, through meta-learning, there have been successful attempts to one-shot imitation learning from humans; however, these approaches require a lot of human resources to collect the data in the real world to train the robot. But is there a way to remove the need for real world human demonstrations during training? We show that with Task-Embedded Control Networks, we can infer control polices by embedding human demonstrations that can condition a control policy and achieve one-shot imitation learning. Importantly, we do not use a real human arm to supply demonstrations during training, but instead leverage domain randomisation in an application that has not been seen before: sim-to-real transfer on humans. Upon evaluating our approach on pushing and placing tasks in both simulation and in the real world, we show that in comparison to a system that was trained on real-world data we are able to achieve similar results by utilising only simulation data. Videos can be found here * .

show abstract

Section: Related Workmentioning

confidence: 99%

Learning One-Shot Imitation From Humans Without Humans

Bonardi

James

Davison

2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

show abstract

“…To exemplify the ability of our approach to improve grasping performance we use a PR2 robot to perform grasps planned by using Grasp Pose Detection (GPD) [32], which predicts a series of 6-DOF candidate grasp poses given a 3D point cloud for a 2-finger grasp. The reachability of the proposed candidate grasps are checked using MoveIt!…”

Section: E Graspingmentioning

confidence: 99%

Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics

Mees

Tatarchenko

Brox

et al. 2019

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

We present a convolutional neural network for joint 3D shape prediction and viewpoint estimation from a single input image. During training, our network gets the learning signal from a silhouette of an object in the input image -a form of self-supervision. It does not require ground truth data for 3D shapes and the viewpoints. Because it relies on such a weak form of supervision, our approach can easily be applied to real-world data. We demonstrate that our method produces reasonable qualitative and quantitative results on natural images for both shape estimation and viewpoint prediction. Unlike previous approaches, our method does not require multiple views of the same object instance in the dataset, which significantly expands the applicability in practical robotics scenarios. We showcase it by using the hallucinated shapes to improve the performance on the task of grasping real-world objects both in simulation and with a PR2 robot.

show abstract

“…Since we have three types of features and three axes to project, we have nine channels in total. For classifier, we use the LeNet [11] structure which is a common structure for grasp pose classification and ranking [7], [10]. The output of the classifier is the binary label {graspable, not graspable} associated with the confidence scores.…”

Section: Grasp Representation and Classificationmentioning

confidence: 99%

GlassLoc: Plenoptic Grasp Pose Detection in Transparent Clutter

Zhou

Pan

et al. 2019

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

Transparent objects are prevalent across many environments of interest for dexterous robotic manipulation. Such transparent material leads to considerable uncertainty for robot perception and manipulation, and remains an open challenge for robotics. This problem is exacerbated when multiple transparent objects cluster into piles of clutter. In household environments, for example, it is common to encounter piles of glassware in kitchens, dining rooms, and reception areas, which are essentially invisible to modern robots. We present the GlassLoc algorithm for grasp pose detection of transparent objects in transparent clutter using plenoptic sensing. GlassLoc classifies graspable locations in space informed by a Depth Likelihood Volume (DLV) descriptor. We extend the DLV to infer the occupancy of transparent objects over a given space from multiple plenoptic viewpoints. We demonstrate and evaluate the GlassLoc algorithm on a Michigan Progress Fetch mounted with a first generation Lytro. The effectiveness of our algorithm is evaluated through experiments for grasp detection and execution with a variety of transparent glassware in minor clutter.

show abstract

High precision grasp pose detection in dense clutter

Cited by 275 publications

References 17 publications

Learning One-Shot Imitation From Humans Without Humans

Learning One-Shot Imitation From Humans Without Humans

Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics

GlassLoc: Plenoptic Grasp Pose Detection in Transparent Clutter

Contact Info

Product

Resources

About