Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Demirel, Berkan; Cinbiş, Ramazan Gökberk; Ikizler-Cinbis, Nazli

doi:10.1109/iccv.2017.139

Cited by 52 publications

(46 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Lampert et al [17,11] show that using attributes provide convenient and cost effective knowledge transfer between seen and unseen classes. Demirel et al [10] use attribute information to learn visually more meaningful word representations. Unlike most other attribute based approaches, their method does not require the human supervised attribute-class relations at test time.…”

Section: Visual Features Visually Meaningful Word Representationmentioning

confidence: 99%

“…For example, semantically similar words, such "wolf" and "bear" are not particularly close in visual domain, whereas visually consistent words such as "mole" and "mouse" can be far apart in semantic word domain. In order to account for such differences, [10] propose to learn a transformation on the word vectors that allows ZSL by comparing the pooled embeddings of attribute names and class names. Below we provide only a brief summary of the image-based training formulation of this approach, a more through explanation can be found in [10].…”

Section: Visually Meaningful Vector Space Word Vectorsmentioning

confidence: 99%

“…In order to account for such differences, [10] propose to learn a transformation on the word vectors that allows ZSL by comparing the pooled embeddings of attribute names and class names. Below we provide only a brief summary of the image-based training formulation of this approach, a more through explanation can be found in [10].…”

Section: Visually Meaningful Vector Space Word Vectorsmentioning

confidence: 99%

“…For AwA and aPaY datasets, we utilize the CNN-M2K features [9], where images are resized to 256x256 and mean image subtraction is applied. Outputs of the last hidden layer are extracted for image representation, as also described in [10].…”

Section: Implementation Detailsmentioning

confidence: 99%

“…[4,5,6]) or attribute-based class specifications (e.g. [1,7,8,9,4,6,10]). Overall, the performance of a ZSL method heavily depends on the prior information as it is the primary factor determining the limits of cross-class knowledge sharing and transfer.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Learning Visually Consistent Label Embeddings for Zero-Shot Learning

Demirel

Cinbiş

Ikizler-Cinbis

2019

2019 IEEE International Conference on Image Processing (ICIP)

Self Cite

View full text Add to dashboard Cite

In this work, we propose a zero-shot learning method to effectively model knowledge transfer between classes via jointly learning visually consistent word vectors and label embedding model in an end-to-end manner. The main idea is to project the vector space word vectors of attributes and classes into the visual space such that word representations of semantically related classes become more closer, and use the projected vectors in the proposed embedding model to identify unseen classes. We evaluate the proposed approach on two benchmark datasets and the experimental results show that our method yields significant improvements in recognition accuracy.

show abstract

Section: Visual Features Visually Meaningful Word Representationmentioning

confidence: 99%

Section: Visually Meaningful Vector Space Word Vectorsmentioning

confidence: 99%

Section: Visually Meaningful Vector Space Word Vectorsmentioning

confidence: 99%

Section: Implementation Detailsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Learning Visually Consistent Label Embeddings for Zero-Shot Learning

Demirel

Cinbiş

Ikizler-Cinbis

2019

2019 IEEE International Conference on Image Processing (ICIP)

Self Cite

View full text Add to dashboard Cite

show abstract

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

Jiang

Wang

Shan

et al. 2018

Lecture Notes in Computer Science

106

View full text Add to dashboard Cite

Zero-shot learning (ZSL) aims to recognize objects of novel classes without any training samples of specific classes, which is achieved by exploiting the semantic information and auxiliary datasets. Recently most ZSL approaches focus on learning visual-semantic embeddings to transfer knowledge from the auxiliary datasets to the novel classes. However, few works study whether the semantic information is discriminative or not for the recognition task. To tackle such problem, we propose a coupled dictionary learning approach to align the visual-semantic structures using the class prototypes, where the discriminative information lying in the visual space is utilized to improve the less discriminative semantic space. Then, zero-shot recognition can be performed in different spaces by the simple nearest neighbor approach using the learned class prototypes. Extensive experiments on four benchmark datasets show the effectiveness of the proposed approach.

show abstract

Deep Multiple Instance Learning for Zero-Shot Image Tagging

Rahman

Khan

2019

Computer Vision – ACCV 2018

View full text Add to dashboard Cite

In-line with the success of deep learning on traditional recognition problem, several end-to-end deep models for zero-shot recognition have been proposed in the literature. These models are successful to predict a single unseen label given an input image, but does not scale to cases where multiple unseen objects are present. In this paper, we model this problem within the framework of Multiple Instance Learning (MIL). To the best of our knowledge, we propose the first end-to-end trainable deep MIL framework for the multi-label zero-shot tagging problem. Due to its novel design, the proposed framework has several interesting features: (1) Unlike previous deep MIL models, it does not use any off-line procedure (e.g., Selective Search or EdgeBoxes) for bag generation. (2) During test time, it can process any number of unseen labels given their semantic embedding vectors. (3) Using only seen labels per image as weak annotation, it can produce a bounding box for each predicted labels. We experiment with the NUS-WIDE dataset and achieve superior performance across conventional, zero-shot and generalized zero-shot tagging tasks.

show abstract

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Cited by 52 publications

References 30 publications

Learning Visually Consistent Label Embeddings for Zero-Shot Learning

Learning Visually Consistent Label Embeddings for Zero-Shot Learning

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

Deep Multiple Instance Learning for Zero-Shot Image Tagging

Contact Info

Product

Resources

About