2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017
DOI: 10.1109/iros.2017.8202206
|View full text |Cite
|
Sign up to set email alerts
|

A self-supervised learning system for object detection using physics simulation and multi-view pose estimation

Abstract: Progress has been achieved recently in object detection given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their applicability in robotics, where solutions must scale to a large number of objects and variety of conditions. This work proposes an autonomous process for training a Convolutional Neural Network (CNN) for object detection and pose estimation in robotic setups. The focus is on dete… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
63
0
1

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 98 publications
(64 citation statements)
references
References 38 publications
0
63
0
1
Order By: Relevance
“…Other lines of work utilize photo-realistic rendering and realistic scene compositions to overcome the domain gap by synthesizing images that match the real world as close as possible [9,13,25,17,1,8,33,18]. While these methods have shown promising results they face many hard challenges.…”
Section: Related Workmentioning
confidence: 99%
“…Other lines of work utilize photo-realistic rendering and realistic scene compositions to overcome the domain gap by synthesizing images that match the real world as close as possible [9,13,25,17,1,8,33,18]. While these methods have shown promising results they face many hard challenges.…”
Section: Related Workmentioning
confidence: 99%
“…Semi-supervised learning (Blum and Mitchell, 1998;Joachims, 1999) addresses this problem by making use of a large amount of unlabeled data and a small amount of labeled data. Similarly, as an autonomous supervised learning approach, self-supervised learning (Mitash et al, 2017) usually extracts and uses the naturally available relevant context and embedded meta data as supervisory signals. Active learning (Arasu et al, 2010;Bellare et al, 2012) is another special case of supervised learning in which a learning algorithm is able to interactively query the user (or some other information source) to obtain the desired outputs at new data points.…”
Section: Machine Learning Paradigmsmentioning
confidence: 99%
“…Eventhough this is promising as data driven approaches could help close the gap between 3d geometric models and noisy observed data, it needs access to a large model aligned 3d training dataset, which may be difficult to collect. Another technique often used is to perform object segmentation using CNNs trained specifically for the setup [11], [2], [19] and perform point cloud registration methods The image describes the process of hypotheses generation for objects present in the scene. The process starts with extracting object segments S 1:3 using Faster-RCNN [6], followed by using a global point cloud registration technique [4] to compute a set of possible model transformations (T 1:3 ) that corresponds to the respective segments.…”
Section: B Progress In Deep Learningmentioning
confidence: 99%
“…In order to address the issue of potentially conflicting candidate object poses, the scene hypotheses are dynamically constructed by introducing a constrained local optimization step over candidate object poses returned by Super4PCS, a fast global model matching method [4]. To limit detection errors that arise in cluttered scene, the proposed method builds on top of a previous contribution [11], which performs clutter-specific autonomous training to get object segments. This paper provides experimental indications that the set of candidate object poses returned by Super4PCS given the clutter-aware training contains object poses that are close enough to the ground truth, however, these might not be the ones that receive the best matching score according to Super4PCS.…”
Section: Introductionmentioning
confidence: 99%