2018
DOI: 10.1007/s11263-018-1125-z
|View full text |Cite
|
Sign up to set email alerts
|

From BoW to CNN: Two Decades of Texture Representation for Texture Classification

Abstract: Texture is a fundamental characteristic of many types of images, and texture representation is one of the essential and challenging problems in computer vision and pattern recognition which has attracted extensive research attention over several decades. Since 2000, texture representations based on Bag of Words (BoW) and on Convolutional Neural Networks (CNNs) have been extensively studied with impressive performance. Given this period of remarkable evolution, this paper aims to present a comprehensive survey … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
216
0
5

Year Published

2019
2019
2021
2021

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 319 publications
(223 citation statements)
references
References 236 publications
(599 reference statements)
2
216
0
5
Order By: Relevance
“…There are basically four components for a conventional biometric system: preprocessing, feature extraction, matching, and decision phase. The feature extraction method affects the performance of the system significantly; there are many feature extraction techniques described in [22]. This paper proposes a multimodal biometric system based on the face and iris, which uses a multi-resolution 2D Log-Gabor filter with spectral regression kernel discriminant analysis (SRKDA) to extract pertinent features from the iris.…”
Section: Introductionmentioning
confidence: 99%
“…There are basically four components for a conventional biometric system: preprocessing, feature extraction, matching, and decision phase. The feature extraction method affects the performance of the system significantly; there are many feature extraction techniques described in [22]. This paper proposes a multimodal biometric system based on the face and iris, which uses a multi-resolution 2D Log-Gabor filter with spectral regression kernel discriminant analysis (SRKDA) to extract pertinent features from the iris.…”
Section: Introductionmentioning
confidence: 99%
“…With VLAD able to capture local shape information, we turn to image texture features as a way to capture differences in surface appearances. The computer vision literature is full of methods for capturing image texture features [16]. The technique of Binarized Statistical Image Features (BSIF) [17] is a relatively recent and robust algorithm for separating distinct textures.…”
Section: Computer Visionmentioning
confidence: 99%
“…e higher accuracy indoor scene classification effect will be achieved by the spatial 3D information contained in the depth image, which is insensitive to light and reflects the position relationship between objects. Features of the original images will be extracted by D-SIFT (Dense SIFT) [32], and similar features will be clustered to form BoW (Bag-of-Words) [33][34][35] by K-means [36,37]. Based on BoW, the initial descriptors set including visual image descriptors and depth image descriptors will be generated with the construction of SPM.…”
Section: Multiple Image Descriptor Generation and Filteringmentioning
confidence: 99%