2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.01048
|View full text |Cite
|
Sign up to set email alerts
|

Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
75
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 185 publications
(76 citation statements)
references
References 19 publications
1
75
0
Order By: Relevance
“…Zhuang et al [22] have also proposed an attentive pairwise interaction framework, inspired by the human mechanism, to identify contrastive clues by comparing two images. Recently, Ji et al [23] have proposed an attention convolutional binary neural tree that characterizes a coarse-to-fine hierarchical model.…”
Section: Related Work a Fine-grained Image Recognitionmentioning
confidence: 99%
“…Zhuang et al [22] have also proposed an attentive pairwise interaction framework, inspired by the human mechanism, to identify contrastive clues by comparing two images. Recently, Ji et al [23] have proposed an attention convolutional binary neural tree that characterizes a coarse-to-fine hierarchical model.…”
Section: Related Work a Fine-grained Image Recognitionmentioning
confidence: 99%
“…The task of Fine-Grained Visual Categorization (FGVC) is one of the most active research topics very recently [1,2,3]. The goal of FGVC is to distinguish the objects shared similar patterns in a specific scenario from different subordinate categories.…”
Section: Introductionmentioning
confidence: 99%
“…However, we argue two problems: (1) There is a lack of works devoting to FGVC on practical electronic commerce (e-commerce) as far as we known, making it an important yet not well studied issue in this domain. (2) The aforementioned approaches simply ignore the internal semantic region(or part) correlation, which reveals the salient information about the images.…”
Section: Introductionmentioning
confidence: 99%
“…The feature extraction network is an essential stage in image processing. Existing FGVC approaches (11)(12)(13)(14)(15)(16)(17)(18)(19)(20)(21)(22)(23)(24)(25)(26)(27) first localize the vehicle parts in the picture and then extract the discriminative features for classification. Most of the previous localization methods (2,15,16) are supervised algorithms and require a lot of auxiliary data.…”
mentioning
confidence: 99%
“…Meanwhile, images labeled with the appropriate vehicle manufacturer and model require professional knowledge in the auto industry. Also, a weakly supervised method (3,11,12,14) to generate discriminative regions not only maximizes the utilization of the semantic information in these regions but also avoids the problem of excessive reliance on labels. However, these methods usually have complex network structures and are not conducive to optimization.…”
mentioning
confidence: 99%