2023
DOI: 10.1016/j.displa.2023.102468
|View full text |Cite
|
Sign up to set email alerts
|

A collaborative gated attention network for fine-grained visual classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 40 publications
0
3
0
Order By: Relevance
“…According to the experimental data in Table 4, on the FGVC-Aircraft dataset, our PGPL improves the classification accuracy by 0.4% and 1.1% compared with the collaborative gated attention network (CG) 1 and progressive multi-granularity (PMG) models, respectively. The CG model emphasizes the interrelationship among cross-layer features and uses channel and spatial attention modules to locate key part features to optimize model performance.…”
Section: Evaluation and Analysis On The Fgvc-aircraft Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…According to the experimental data in Table 4, on the FGVC-Aircraft dataset, our PGPL improves the classification accuracy by 0.4% and 1.1% compared with the collaborative gated attention network (CG) 1 and progressive multi-granularity (PMG) models, respectively. The CG model emphasizes the interrelationship among cross-layer features and uses channel and spatial attention modules to locate key part features to optimize model performance.…”
Section: Evaluation and Analysis On The Fgvc-aircraft Datasetmentioning
confidence: 99%
“…Fine-grained image classification (FGIC) aims to distinguish specific subcategories from the same superclass. 1 It plays a key role in many science and engineering fields, such as environmental protection, 2 intelligent transportation, 3 and medical image diagnosis. 4 Compared with general image classification, FGIC is more challenging due to the minor inter-class differences and the large intra-class variations.…”
Section: Introductionmentioning
confidence: 99%
“…The peak response region feature of the feature map is used to cluster the channels with similar response regions to obtain local regions with discrimination. At the same time, the channel grouping loss function is used to increase the inter-class differentiation and reduce the intra-class differentiation; Zhang et al [19] control the contribution of different regions to recognition through the gating mechanism; Zhu et al [20] proposed a simple and effective cross door attention learning strategy that guides the final classification through rich discriminative features in key regions, and achieved good results.…”
Section: Related Workmentioning
confidence: 99%