2008
DOI: 10.1007/s11263-007-0118-0
|View full text |Cite
|
Sign up to set email alerts
|

Object Class Recognition and Localization Using Sparse Features with Limited Receptive Fields

Abstract: We

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

4
337
0
1

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 321 publications
(342 citation statements)
references
References 32 publications
(60 reference statements)
4
337
0
1
Order By: Relevance
“…We assume that the hierarchical processing in both pathways is organized in a similar fashion and, thus, make use of a generic processing architecture for neural feature extraction as shown in Figure 1. The architecture is a modified variant of the object-recognition model proposed by [10,16,26]. In layer S1, different scale representations of the input image are convolved with 2D Gabor filters of different orientations (form path) and a spatiotemporal correlation detector is used to build a discrete velocity space representation (motion path).…”
Section: Visual Featuresmentioning
confidence: 99%
See 2 more Smart Citations
“…We assume that the hierarchical processing in both pathways is organized in a similar fashion and, thus, make use of a generic processing architecture for neural feature extraction as shown in Figure 1. The architecture is a modified variant of the object-recognition model proposed by [10,16,26]. In layer S1, different scale representations of the input image are convolved with 2D Gabor filters of different orientations (form path) and a spatiotemporal correlation detector is used to build a discrete velocity space representation (motion path).…”
Section: Visual Featuresmentioning
confidence: 99%
“…The layer S2 is created by a simple template matching of patches of C1 activities against a number of prototype patches. These prototypes are randomly selected during the learning stage (for details, see [10]). In the final layer C2, the S2 prototype responses are again pooled over a limited neighborhood and combined into a single feature vector which serves as input to the successive classification stage.…”
Section: Visual Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…These findings in conjunction with experimental studies of the visual cortex justify the use of such filters in the so-called standard model for object recognition (Riesenhuber & Poggio, 1999;Serre et al, 2005;Mutch & Lowe, 2008), whose filters are fixed, in contrast to those of Convolutional Neural Networks (CNNs) (LeCun et al, 1998;Behnke, 2003;Simard et al, 2003), whose weights (filters) are randomly initialized and learned in a supervised way using back-propagation (BP). A DNN, the basic building block of our proposed MCDNN, is a hierarchical deep neural network, alternating convolutional with max-pooling layers (Riesenhuber & Poggio, 1999;Serre et al, 2005;Scherer et al, 2010).…”
Section: Introductionmentioning
confidence: 97%
“…The local contour information that is extracted by orientation-selective cells provides basis for further, more complex visual tasks, such as object recognition [16,14,12,19,17,15]. The performance of various contour operators, which are inspired by the function of simple cells, in contour detection tasks has, however, not been quantified and they have not been compared in that respect.…”
Section: Introductionmentioning
confidence: 99%