2016
DOI: 10.1109/lgrs.2015.2513443
|View full text |Cite
|
Sign up to set email alerts
|

Bag-of-Visual-Words Scene Classifier With Local and Global Features for High Spatial Resolution Remote Sensing Imagery

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
158
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 325 publications
(160 citation statements)
references
References 22 publications
1
158
0
1
Order By: Relevance
“…Whereas, SIFT descriptor and HOG feature are local features that are used for the representations of local structure [108] and shape information [109]. To represent an entire scene image, they are generally used as building blocks to construct global image features, such as the well-known bag-of-visual-words (BoVW) models [6,8,9,14,19,29,36,38,39,55,93,101,122,123] and HOG feature-based part models [22,23,27,103]. In addition, a number of improved feature encoding/pooling methods have also been proposed in the past few years, such as Fisher vector coding [10,14,84,86], spatial pyramid matching (SPM) [124], and probabilistic topic model (PTM) [11,40,42,43,92,123], etc.…”
Section: A Handcrafted Feature Based Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Whereas, SIFT descriptor and HOG feature are local features that are used for the representations of local structure [108] and shape information [109]. To represent an entire scene image, they are generally used as building blocks to construct global image features, such as the well-known bag-of-visual-words (BoVW) models [6,8,9,14,19,29,36,38,39,55,93,101,122,123] and HOG feature-based part models [22,23,27,103]. In addition, a number of improved feature encoding/pooling methods have also been proposed in the past few years, such as Fisher vector coding [10,14,84,86], spatial pyramid matching (SPM) [124], and probabilistic topic model (PTM) [11,40,42,43,92,123], etc.…”
Section: A Handcrafted Feature Based Methodsmentioning
confidence: 99%
“…Every individual cue captures only one aspect of the scene, so one single type of feature is always inadequate to represent the content of the entire scene image. Accordingly, a combination of multiple complementary features for scene classification [8,9,11,12,20,30,33,85,88,89,92,125] is considered as a potential strategy to improve the performance. For example, Zhao et al [11] presented a dirichlet derived multiple topic model to combine three types of features at a topic level for scene classification.…”
Section: A Handcrafted Feature Based Methodsmentioning
confidence: 99%
“…Hence, to extract a holistic and discriminative feature representation is the most significant part for scene classification. Traditional approaches are mostly based on the Bag-of-Visual-Words (BoVW) model [78,79], but their potential for improvement was limited by the ability of experts to design the feature extractor and the expressive power encoded.…”
Section: B Interpretation Of Sar Imagesmentioning
confidence: 99%
“…We used the traditional two methods and also reproduced the method which uses BoVW model [39]. Simultaneously, we fine-tune the three famous and well-trained CNN models to make the model adaptable to the new classified data set.…”
Section: Experiments and Resultsmentioning
confidence: 99%