2007 IEEE Conference on Computer Vision and Pattern Recognition 2007
DOI: 10.1109/cvpr.2007.383229
|View full text |Cite
|
Sign up to set email alerts
|

Multiple Class Segmentation Using A Unified Framework over Mean-Shift Patches

Abstract: Object-based segmentation is a challenging topic. Most of the previous algorithms focused on segmenting a single or a small set of objects. In this paper, the multiple class object-based segmentation is achieved using the appearance and bag of keypoints models integrated over meanshift patches. We also propose a novel affine invariant descriptor to model the spatial relationship of keypoints and apply the Elliptical Fourier Descriptor to describe the global shapes. The algorithm is computationally efficient an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
94
0

Year Published

2008
2008
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 122 publications
(97 citation statements)
references
References 24 publications
2
94
0
Order By: Relevance
“…We split the database in two halves for training and testing. Following the protocol of the other works [26,33,14,36], we ignored the void class during training and evaluation, and horse and mountain classes were also ignored due to insuffiencient amount of representative in the database. The reported results are the pixel-wise accuracy, that is the percentage of pixels correctly classified with respect to the total number of pixels in the segmentation ground truth.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We split the database in two halves for training and testing. Following the protocol of the other works [26,33,14,36], we ignored the void class during training and evaluation, and horse and mountain classes were also ignored due to insuffiencient amount of representative in the database. The reported results are the pixel-wise accuracy, that is the percentage of pixels correctly classified with respect to the total number of pixels in the segmentation ground truth.…”
Section: Methodsmentioning
confidence: 99%
“…Note that the results were not obtained using the same training and test datasets. Gould et al [14] used 40% images for training, 60% for test, Shotton et al [26], Verbeek and Triggs [33] split the dataset in two halves, Yang et al [36] reports the average results over a 5-fold cross validation. Our best method (merging multiple segmentation with mean operator) ranks second with for global pixel-wise accuracy with 76.1 behind Gould (76.5).…”
Section: Impact Of the Relaxation Labellingmentioning
confidence: 99%
“…A number of recent works have reported encouraging results on this dataset: Shotton et al (TextonBoost) [27] introduced novel shape-texture features in a boosted framework, and used a CRF to integrate these with other cues; Verbeek and Triggs [28] explored combining spatial field models (like MRFs) with aspect-based models (like PLSA, LDA); Yang et al [30] combined texture, keypoint spatial co-occurrence and global shape into a meanshift framework to perform multi-class segmentation of images. In order to enable direct comparisons against TextonBoost, we duplicate Shotton et al's experimental methodology [27] and employ a random split of 45% for training, 10% for validation, and 45% for testing, while maintaining a similar distribution of classes.…”
Section: -Class Msrcmentioning
confidence: 99%
“…Semantic segmentation has attracted a lot of attention, but most works have focused on the fully supervised setting, where pixel labels are given for the training images [15,14,12,21,22,26]. The basic approach was formulated in [22], where a conditional random field (CRF) was defined over image pixels with unary potentials learnt by a boosted decision tree classifier over texture-layout filters.…”
Section: Related Workmentioning
confidence: 99%
“…This problem is very challenging because the method has to recover latent pixel labels from just presence labels, before it can generalize from the training set to test images. Recently there has been significant progress in fully supervised semantic segmentation [15,14,12,21,22,26], although the problem is still unsolved. The disadvantage of fully supervised techniques is the need for manually labeling pixels in the training set, which is time consuming and expensive.…”
Section: Introductionmentioning
confidence: 99%