2013
DOI: 10.1007/978-3-319-00065-7_27
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Feature Learning for RGB-D Based Object Recognition

Abstract: Abstract. Recently introduced RGB-D cameras are capable of providing high quality synchronized videos of both color and depth. With its advanced sensing capabilities, this technology represents an opportunity to dramatically increase the capabilities of object recognition. It also raises the problem of developing expressive features for the color and depth channels of these sensors. In this paper we introduce hierarchical matching pursuit (HMP) for RGB-D data. HMP uses sparse coding to learn hierarchical featu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
208
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 276 publications
(208 citation statements)
references
References 35 publications
0
208
0
Order By: Relevance
“…In the committee above if we only keep mirroring as dataset augmentation, we get 67.39%. Unsupervised feature learning by augmenting single images [6] 67.4 ± 0.6 Efficient Discriminative Convolution Using Fisher Weight Map [12] 66.0 ± 0.7 Unsupervised Feature Learning for RGB-D Based Object Recognition [1] 64.5 ± 1 Discriminative Learning of Sum-Product Networks [8] 62.3 ± 1 Selecting Receptive Fields in Deep Networks [3] 60.1 ± 1 This paper 68.0 ± 0.55…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In the committee above if we only keep mirroring as dataset augmentation, we get 67.39%. Unsupervised feature learning by augmenting single images [6] 67.4 ± 0.6 Efficient Discriminative Convolution Using Fisher Weight Map [12] 66.0 ± 0.7 Unsupervised Feature Learning for RGB-D Based Object Recognition [1] 64.5 ± 1 Discriminative Learning of Sum-Product Networks [8] 62.3 ± 1 Selecting Receptive Fields in Deep Networks [3] 60.1 ± 1 This paper 68.0 ± 0.55…”
Section: Resultsmentioning
confidence: 99%
“…Thus, we disregard the original meaning of the classifier output and scale the set of values to [0, 1] in order to obtain some scores that will be later combined. Now, for an input test image, we have N sets of scores in [0,1]. Each set of scores corresponds to the output of one network.…”
Section: Building the Committeementioning
confidence: 99%
“…Later on, due to its capability of providing accurate depth information with relatively low cost, the usage of Kinect goes beyond gaming, and is extended to the computer vision field. This device equipped with intelligent algorithms is contributing to various applications, such as 3D-simultaneous localization and mapping (SLAM) [39,54], people tracking [69], object recognition [11] and human activity analysis [13,57], etc. In this section, we introduce Kinect from two perspectives: hardware configuration and software tools.…”
Section: A Brief Review Of Kinectmentioning
confidence: 99%
“…Feature learning has been applied successfully in computer vision; the technique has achieved state-of-the-art performance in object detection [25], image classification [26] and object recognition [27] tasks. We demonstrate the utility and generality of such a system by using it for automated classification of three invasive weed species on the north-west slopes of New South Wales, Australia.…”
Section: Introductionmentioning
confidence: 99%