Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Dwibedi, Debidatta; Misra, Ishan; Hebert, Martial

doi:10.1109/iccv.2017.146

Cited by 589 publications

(551 citation statements)

References 62 publications

Supporting

Mentioning

540

Contrasting

Unclassified

Order By: Relevance

“…A similar problem is also pointed out in previous work [10]: they had objects for which it is difficult to extract foreground mask by thresholding of the depth image, in particular with transparent objects such as Cola Bottle. To overcome this difficulty, they trained a small convolutional neural network (ConvNet) model [21] using the mask acquired from thresholding of the depth image as the ground truth.…”

Section: Image Synthesis For Learning Instance Occlusion Segmentatsupporting

confidence: 57%

“…Above work focus on developing ways to make synthetic images closer to real images. On the other hand, it has recently been found that synthesizing only 2D instance images of objects is also effective to train detection models of object bounding boxes [9,10]. The base idea for this is that if we could generate infinite synthetic images at random and train learning model with it, the model would generalize to real images.…”

Section: B Image Synthesis For Object Detectionmentioning

confidence: 99%

“…Blending is necessary for 2D image synthesis to remove boundary artifacts when we put the instance images onto the background image. We apply gaussian blurring following [10] with a random sigma which ranges from 0 to 1.…”

Section: B Blendingmentioning

confidence: 99%

“…Data augmentation is crucial for generating synthetic data to train detection models that will generalize to real images, especially when we only have a few instance images (4 − 6), compared to 600 images found in [10]. We applied color data augmentation in addition to the geometric augmentation present in past work [10]. To make the system more robust to changes in brightness and light reflection, we also applied multiplication to S and V channels after converting the RGB image into HSV color space with random selection of scale between 0.5 to 2.0.…”

Section: Data Augmentationmentioning

confidence: 99%

“…• Image synthesis framework for learning instance segmentation including occlusion, which is a straightforward extension to recent works [9,10]; • A novel instance segmentation network model that uses the instance density to segment multi-class masks by extending recent works [7,8]; • A new metric for instance segmentation of multi-class masks extending recent work of instance segmentation of a single mask [11]; • The integrated system of the above components and demonstration in the picking task from a pile of objects.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Instance Segmentation of Visible and Occluded Regions for Finding and Picking Target from a Pile of Objects

Wada

Kitagawa

Okada

et al. 2018

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

We present a robotic system for picking a target from a pile of objects that is capable of finding and grasping the target object by removing obstacles in the appropriate order. The fundamental idea is to segment instances with both visible and occluded masks, which we call 'instance occlusion segmentation'. To achieve this, we extend an existing instance segmentation model with a novel 'relook' architecture, in which the model explicitly learns the inter-instance relationship. Also, by using image synthesis, we make the system capable of handling new objects without human annotations. The experimental results show the effectiveness of the relook architecture when compared with a conventional model and of the image synthesis when compared to a human-annotated dataset. We also demonstrate the capability of our system to achieve picking a target in a cluttered environment with a real robot.• Instance occlusion segmentation neural networks trained using the generated images ( §IV);

show abstract

Section: Image Synthesis For Learning Instance Occlusion Segmentatsupporting

confidence: 57%

Section: B Image Synthesis For Object Detectionmentioning

confidence: 99%

Section: B Blendingmentioning

confidence: 99%

Section: Data Augmentationmentioning

confidence: 99%