2014
DOI: 10.1007/978-3-319-10599-4_41
|View full text |Cite
|
Sign up to set email alerts
|

Sliding Shapes for 3D Object Detection in Depth Images

Abstract: Abstract. The depth information of RGB-D sensors has greatly simplified some common challenges in computer vision and enabled breakthroughs for several tasks. In this paper, we propose to use depth maps for object detection and design a 3D detector to overcome the major difficulties for recognition, namely the variations of texture, illumination, shape, viewpoint, clutter, occlusion, selfocclusion and sensor noises. We take a collection of 3D CAD models and render each CAD model from hundreds of viewpoints to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
294
0
1

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 332 publications
(295 citation statements)
references
References 81 publications
0
294
0
1
Order By: Relevance
“…As an empirical observation, we estimate that the gap size should be roughly equal to half the average cluster size for an optimal reconstruction. Note that these cluster sizes are much larger than the ones usually found in the literature for similar tasks, such as object detection and scene reconstruction [34,20,30].…”
Section: Resultsmentioning
confidence: 73%
“…As an empirical observation, we estimate that the gap size should be roughly equal to half the average cluster size for an optimal reconstruction. Note that these cluster sizes are much larger than the ones usually found in the literature for similar tasks, such as object detection and scene reconstruction [34,20,30].…”
Section: Resultsmentioning
confidence: 73%
“…Barron et al [6] leverage the depth for intrinsic image decomposition of natural scenes. Song and Xiao [34] show that depth can avoid many difficulties for object detection from RGB images, such as variations in illumination, shape, and viewpoint. In particular, they adopt the Truncated Signed Distance Function (TSDF) as a cue for self-occlusion.…”
Section: Previous Workmentioning
confidence: 99%
“…As consumer depth sensors become widely adopted, e.g. Microsoft Kinect, the depth information has brought breakthroughs to several vision tasks, such as pose estimation [31], intrinsic image decomposition [6], and object detection [34]. The idea of leveraging depth cues to allow monocular scene flow estimation has also received increasing attention.…”
Section: Introductionmentioning
confidence: 99%
“…For example, [18,17] use objects from Google's 3D Warehouse to train an object detection system for 3D point clouds collected by robots navigating through urban and indoor environments. [28] uses 3D CAD models as templates for a sliding window search. Shen et al [25] use a database of segmented models and assembly-based modeling algorithms to reconstruct novel 3D models.…”
Section: Related Workmentioning
confidence: 99%