2017
DOI: 10.1109/tip.2017.2675339
|View full text |Cite
|
Sign up to set email alerts
|

Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs

Abstract: Convolutional neural networks (CNNs) have made remarkable progress on scene recognition, partially due to these recent large-scale scene datasets, such as the Places and Places2. Scene categories are often defined by multi-level information, including local objects, global layout, and background environment, thus leading to large intra-class variations. In addition, with the increasing number of scene categories, label ambiguity has become another crucial issue in large-scale classification. This paper focuses… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
87
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 136 publications
(87 citation statements)
references
References 51 publications
0
87
0
Order By: Relevance
“…When a mixed CCM-CCG model is used, our FOSNet achieves state-of-the-art accuracy of 60.14% on the Places 2, and it is the first time that the accuracy exceeds 60% on the dataset. [1] 56.2 Gaze Shifting-CNN+SVM [19] 56.2 MetaObject-CNN [15] 58.11 Places365-VGG-SVM [28] 63.24 Three [5] 70.17 Hybrid CNN [21] 70.69 Sparse Representation [23] 71.08 Multi-Resolution CNNs [7] 72.0 CNN-SMN [14] 72.6 PatchNet [22] 73.0 SDO [6] 73.41 Adi-Red [13] 73.59 SOSF+CFA+GAF [20] 78…”
Section: Experimental Results On the Placesmentioning
confidence: 99%
See 4 more Smart Citations
“…When a mixed CCM-CCG model is used, our FOSNet achieves state-of-the-art accuracy of 60.14% on the Places 2, and it is the first time that the accuracy exceeds 60% on the dataset. [1] 56.2 Gaze Shifting-CNN+SVM [19] 56.2 MetaObject-CNN [15] 58.11 Places365-VGG-SVM [28] 63.24 Three [5] 70.17 Hybrid CNN [21] 70.69 Sparse Representation [23] 71.08 Multi-Resolution CNNs [7] 72.0 CNN-SMN [14] 72.6 PatchNet [22] 73.0 SDO [6] 73.41 Adi-Red [13] 73.59 SOSF+CFA+GAF [20] 78…”
Section: Experimental Results On the Placesmentioning
confidence: 99%
“…In addition, the traits that features appearing in each image region within scene images are all similar was used in [18]. A super category was proposed to solve the problem that the scene categories have label ambiguity in [7]. A deep gaze shifting kernel was developed to distinguish sceneries from different categories in [19].…”
Section: A Traits In Scene Imagementioning
confidence: 99%
See 3 more Smart Citations