2013 IEEE International Conference on Computer Vision 2013
DOI: 10.1109/iccv.2013.321
|View full text |Cite
|
Sign up to set email alerts
|

Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions

Abstract: The main question we address in this paper is how to use purely textual description of categories with no training images to learn visual classifiers for these categories. We propose an approach for zero-shot learning of object categories where the description of unseen categories comes in the form of typical text such as an encyclopedia entry, without the need to explicitly defined attributes. We propose and investigate two baseline formulations, based on regression and domain adaptation. Then, we propose a n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
259
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 243 publications
(259 citation statements)
references
References 33 publications
(52 reference statements)
0
259
0
Order By: Relevance
“…Finally, by modeling the label relations and ensuring consistency between visual predictions and semantic relations, our approach relates to work in transfer learning [31,30], zero-shot learning [28,12,25], and attribute-based recognition [1,36,33,14], especially those that use semantic knowledge to improve recognition [16,30] and those that propagate or borrow annotations between categories [26,22].…”
Section: Related Workmentioning
confidence: 99%
“…Finally, by modeling the label relations and ensuring consistency between visual predictions and semantic relations, our approach relates to work in transfer learning [31,30], zero-shot learning [28,12,25], and attribute-based recognition [1,36,33,14], especially those that use semantic knowledge to improve recognition [16,30] and those that propagate or borrow annotations between categories [26,22].…”
Section: Related Workmentioning
confidence: 99%
“…More recent studies take advantages of an embedding approach as middle layers between low-level features and class labels [4], [7], [8], [9], [10], [11]. Besides, some novel works study how to directly construct classifiers for unseen classes [12], [13], [14]. The latter stream focuses on how to effectively represent human knowledge that can generalise to novel classes, such as human-nameable attributes [2], [15], [16], [17], [18], word vectors [3], [19], textual descriptions [20], and class similarities [21], [22].…”
Section: Introductionmentioning
confidence: 99%
“…Such issues make using attributes impractical. As a low-cost solution, text-based semantic features is proposed [32,10,37,26]. However, the textual description from the Internet can be noisy and not directly related to the visual appearance.…”
Section: Related Workmentioning
confidence: 99%