2015 IEEE Winter Conference on Applications of Computer Vision 2015
DOI: 10.1109/wacv.2015.147
|View full text |Cite
|
Sign up to set email alerts
|

Semantic Instance Labeling Leveraging Hierarchical Segmentation

Abstract: Most of the approaches for indoor RGBD semantic labeling focus on using pixels or superpixels to train a classifier. In this paper, we implement a higher level segmentation using a hierarchy of superpixels to obtain a better segmentation for training our classifier. By focusing on meaningful segments that conform more directly to objects, regardless of size, we train a random forest of decision trees as a classifier using simple features such as the 3D size, LAB color histogram, width, height, and shape as spe… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 25 publications
0
7
0
Order By: Relevance
“…Notice that the authors of [26] achieve very high performances by exploiting a much more complex deep learning architecture. In any case, our method is the one that gets closer to it, while even the very recent methods of [39, 25] have lower performances than ours.…”
Section: Resultsmentioning
confidence: 73%
See 3 more Smart Citations
“…Notice that the authors of [26] achieve very high performances by exploiting a much more complex deep learning architecture. In any case, our method is the one that gets closer to it, while even the very recent methods of [39, 25] have lower performances than ours.…”
Section: Resultsmentioning
confidence: 73%
“…In order to evaluate the accuracy of this labelling, we compared it with some competing approaches on the NYUD2 test set. The compared state-of-the-art approaches are the methods of [23] that uses a multi-scale CNN, of [39] that uses a hierarchy of super-pixels to train a random forest classifier, of [27] that uses deep learning to extract super-pixels features, of [25] exploiting two different CNNs, of [20] using random forest and CRFs and finally [26] using a multi-scale deep learning architecture. Table 2 reports the results: two different metrics have been considered, the per-pixel accuracy, counting the percentage of correctly classified pixels and the average class accuracy, obtained by computing the percentage of correctly classified pixels for each class independently and averaging the values.…”
Section: Evaluation Of the Classification Accuracymentioning
confidence: 99%
See 2 more Smart Citations
“…For example, Lin et al [18] proposed a 3D scene recovery system, in which objects in different categories are reconstructed with domain-specific knowledge based on the results of semantic segmentation. Unlike the RGB-D data used by the previous works [3,7,8,19], outdoor LIDAR data for semantic segmentation lacks the color and texture information. Meanwhile, accurate semantic segmentation of 3D point clouds is a challenging task due to occlusions caused by obstructions, varying point density caused by different distances of objects from the sensor and complex object structures appearing in 3D point clouds.…”
Section: Introductionmentioning
confidence: 99%