2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00115
|View full text |Cite
|
Sign up to set email alerts
|

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

Abstract: Original SAE SP-AEN Train Test Test SUN CUB Train Test Test (a) (b) Figure 1: (a) Attribute variance heat maps of the 312 attributes in CUB birds [60] and the 102 attributes in SUN scenes [47] (lighter color indicates lower variance, i.e., lower discriminability) and the t-SNE [35] visualizations of the test images represented by all attributes (left) and only the high-variance ones (right). Some of the low-variance attributes (the lighter part to the left of the cut-off line) discarded at training are still n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
176
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 272 publications
(176 citation statements)
references
References 53 publications
0
176
0
Order By: Relevance
“…We provide a general summary of the methods presented in [2], and encourage the reader to study that paper in order to obtain more details on previous works. The majority of the ZSL and GZSL methods tend to compensate the lack of visual representation of the unseen classes with the learning of a mapping between visual and semantic spaces [16], [17]. For instance, a fairly successful approach is based on a bi-linear compatibility function that associates visual representation and semantic features.…”
Section: Literature Reviewmentioning
confidence: 99%
“…We provide a general summary of the methods presented in [2], and encourage the reader to study that paper in order to obtain more details on previous works. The majority of the ZSL and GZSL methods tend to compensate the lack of visual representation of the unseen classes with the learning of a mapping between visual and semantic spaces [16], [17]. For instance, a fairly successful approach is based on a bi-linear compatibility function that associates visual representation and semantic features.…”
Section: Literature Reviewmentioning
confidence: 99%
“…and compute M 0 and M c by equations (11) and (12). 6: Compute β by solving equation (16) and obtain f via the representer theorem in equation (8). 7: Update the so labels of D t :ˆ t = f (Z t ).…”
Section: Algorithm 1 Manifold Embedded Distribution Alignmentmentioning
confidence: 99%
“…e rapid growth of online media and content sharing applications has stimulated a great demand for automatic recognition and analysis for images and other multimedia data [8,20]. Unfortunately, it is o en expensive and time-consuming to acquire su cient labeled data to train machine learning models.…”
Section: Introductionmentioning
confidence: 99%
“…According to [7], aPY [10] has a much smaller cosine similarity (0.58) between the attribute variances of the disjoint train and test images than the other datasets (0.98 for SUN, 0.95 for CUB, 0.74 for AwA2), which means it is harder to synthesize and classify images of unseen classes. Although previous methods have relatively low accuracy for unseen classes, our performance gain is even higher with such a difficult dataset.…”
Section: Resultsmentioning
confidence: 99%