Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition

Wang, Yaming; Morariu, Vlad I.; Davis, Larry S.

doi:10.1109/cvpr.2018.00436

Cited by 392 publications

(238 citation statements)

References 49 publications

Supporting

Mentioning

236

Contrasting

Unclassified

Order By: Relevance

“…From the results presented in Tables I and II, we can observe that a sub-component level segmentation strategy, supported by the secondary fine-grain CNN classification of DFL model [22], offers significantly superior anomaly detection performance (A: 97.91, TP: 98.20, FP: 3.50 - Table I) than an object level segmentation strategy overall (Table II). Furthermore, fine-grain CNN classification similarly offers the highest overall accuracy and lowest false positive rate (A: 89.77, FP: 3.88 - Table II) for object level segmentation.…”

Section: Discussionmentioning

confidence: 93%

“…Second stage binary classification via CNN performed less well overall with the sub-component segmentation strategy (lower accuracy (A) caused by significantly higher false positive (FP) - Table II). Fine grain classification model (DFL [22]) offer the lowest false positive and maximal accuracy for both segmentation strategies (Table I). We can deduce that increased levels isolation via segmentation to the sub-component level improves the performance of the discriminative feature space learnt by the fine-grain technique [20]- [22] whilst more classical object classification CNN architectures perform only marginally better on objects than sub-components (Table II).…”

Section: Discussionmentioning

confidence: 99%

“…Fine grain classification model (DFL [22]) offer the lowest false positive and maximal accuracy for both segmentation strategies (Table I). We can deduce that increased levels isolation via segmentation to the sub-component level improves the performance of the discriminative feature space learnt by the fine-grain technique [20]- [22] whilst more classical object classification CNN architectures perform only marginally better on objects than sub-components (Table II). Figure 4 illustrates the attention (red/pink patch with the highest focus) of the fine-grain DFL model [22] whilst trained on object-level and sub-component level segmentation data respectively.…”

Section: Discussionmentioning

confidence: 99%

See 2 more Smart Citations

On the Impact of Object and Sub-Component Level Segmentation Strategies for Supervised Anomaly Detection within X-Ray Security Imagery

Bhowmik

Gaus

Akçay

et al. 2019

2019 18th IEEE International Conference on Machine Learning and Applications (ICMLA)

View full text Add to dashboard Cite

X-ray security screening is in widespread use to maintain transportation security against a wide range of potential threat profiles. Of particular interest is the recent focus on the use of automated screening approaches, including the potential anomaly detection as a methodology for concealment detection within complex electronic items. Here we address this problem considering varying segmentation strategies to enable the use of both object level and sub-component level anomaly detection via the use of secondary convolutional neural network (CNN) architectures. Relative performance is evaluated over an extensive dataset of exemplar cluttered X-ray imagery, with a focus on consumer electronics items. We find that sub-component level segmentation produces marginally superior performance in the secondary anomaly detection via classification stage, with true positive of ∼ 98% of anomalies, with a ∼ 3% false positive.

show abstract

Section: Discussionmentioning

confidence: 93%

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

On the Impact of Object and Sub-Component Level Segmentation Strategies for Supervised Anomaly Detection within X-Ray Security Imagery

Bhowmik

Gaus

Akçay

et al. 2019

2019 18th IEEE International Conference on Machine Learning and Applications (ICMLA)

View full text Add to dashboard Cite

show abstract

“…Recently, some frameworks employ a more general architecture that can localize discriminative parts within an image without any extra supervision from part annotations, and thus it makes the fine-grained image classification more feasible in real-world scenarios. Wang et al [40] claimed that improving mid-level convolutional feature representation can bring significant advantages for part-based fine-grained classification. This is accomplished by introducing a bank of discriminative filters in the classical convolutional neural networks (CNNs) architecture and it can be trained in an endto-end fashion.…”

Section: A Fined-grained Image Classificationmentioning

confidence: 99%

“…We can observe that feature channels after going through the MC-Loss become class-aligned and each focues on different discriminative regions that roughly correspond to object parts. subtle differences is the key for solving fine-grained image classification [40], [46], [47].…”

Section: Introductionmentioning

confidence: 99%

The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification

Chang

Ding

Xie

et al. 2020

IEEE Trans. on Image Process.

316

102

View full text Add to dashboard Cite

Key for solving fine-grained image categorization is finding discriminate and local regions that correspond to subtle visual traits. Great strides have been made, with complex networks designed specifically to learn part-level discriminate feature representations. In this paper, we show it is possible to cultivate subtle details without the need for overly complicated network designs or training mechanisms -a single loss is all it takes. The main trick lies with how we delve into individual feature channels early on, as opposed to the convention of starting from a consolidated feature map. The proposed loss function, termed as mutual-channel loss (MC-Loss), consists of two channel-specific components: a discriminality component and a diversity component. The discriminality component forces all feature channels belonging to the same class to be discriminative, through a novel channel-wise attention mechanism. The diversity component additionally constraints channels so that they become mutually exclusive on spatial-wise. The end result is therefore a set of feature channels that each reflects different locally discriminative regions for a specific class. The MC-Loss can be trained end-to-end, without the need for any bounding-box/part annotations, and yields highly discriminative regions during inference. Experimental results show our MC-Loss when implemented on top of common base networks can achieve state-of-the-art performance on all four fine-grained categorization datasets (CUB-Birds, FGVC-Aircraft, Flowers-102, and Stanford-Cars). Ablative studies further demonstrate the superiority of MC-Loss when compared with other recently proposed general-purpose losses for visual classification, on two different base networks. Code available at https://github.com/ dongliangchang/Mutual-Channel-Loss

show abstract