2021
DOI: 10.3390/s21093241
|View full text |Cite
|
Sign up to set email alerts
|

Zero-Shot Image Classification Based on a Learnable Deep Metric

Abstract: The supervised model based on deep learning has made great achievements in the field of image classification after training with a large number of labeled samples. However, there are many categories without or only with a few labeled training samples in practice, and some categories even have no training samples at all. The proposed zero-shot learning greatly reduces the dependence on labeled training samples for image classification models. Nevertheless, there are limitations in learning the similarity of vis… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 33 publications
0
3
0
Order By: Relevance
“…Both types of information are projected onto the MKE space to infer similarities between seen and unseen classes. A zero-shot image classification method based on a learnable deep metric (ZIC-LDM) [35] was proposed to learn a common space so that image and semantic features can be mapped onto this space to help with the semantic gap problem. In several papers [36][37][38][39][40][41], auto-encoders were used to map visual and semantic information onto the latent space.…”
Section: Latent Space Embeddingmentioning
confidence: 99%
See 1 more Smart Citation
“…Both types of information are projected onto the MKE space to infer similarities between seen and unseen classes. A zero-shot image classification method based on a learnable deep metric (ZIC-LDM) [35] was proposed to learn a common space so that image and semantic features can be mapped onto this space to help with the semantic gap problem. In several papers [36][37][38][39][40][41], auto-encoders were used to map visual and semantic information onto the latent space.…”
Section: Latent Space Embeddingmentioning
confidence: 99%
“…The present study focused on semantic-based few-shot image recognition, so the following recommendations for researchers are mainly on this topic. The most critical issue for semantic-based FSL is the domain shift problem [26,35]. Visual and semantic features have different properties and cannot be combined directly.…”
Section: Recommendationsmentioning
confidence: 99%
“…The goal of this module is to calculate the degree of similarity between samples and to generate a correlation result [28]. Dense, Activation Relu, and Dropout layers were used in this module.…”
Section: ) the Relation Modulementioning
confidence: 99%