The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding

Patterson, Geneviève; Xu, Chen; Su, Hang; Hays, James

doi:10.1007/s11263-013-0695-z

Cited by 367 publications

(254 citation statements)

References 46 publications

Supporting

Mentioning

249

Contrasting

Order By: Relevance

“…We use a publicly available SUN Attribute dataset (SUNAttribute) 4 [20] that comes with the averaged score over MTurk annotations of attribute being present in the image. In the second experiment, we focus on differentiating between 'easy' and 'hard' images of animal classes.…”

Section: Methodsmentioning

confidence: 99%

“…Fast forward to 2008 and beyond, the needs for learning with multiple noisy annotations are further exemplified by the advent of crowdsourcing platforms [30,24,37,1,2,15]. Prior work ranges from a simple majority voting where all annotators are weighted equally [20,26] to a weighted voting by quantifying the expertise of the annotators [8]. Work that actively selects both the informative instances and the high-quality annotators also exists [25,15].…”

Section: Related Workmentioning

confidence: 99%

“…With MTurk, now it becomes possible to collect annotations for large datasets such as ImageNet [26], TinyImages [31], COCO [14], and Places [38]. Moreover, it becomes prevalent to collect task-specific datasets, for example for studying the attributes and their strength [20] and for determining the easiness or hardness of a particular classification task [22]. Those task-specific datasets often require annotations that are more ambiguous than typical object annotations 'present' or 'not present'.…”

Section: Introductionmentioning

confidence: 99%

“…A confidence in the label annotations, ranging from a simple percentage agreement to the kappa statistic, is then computed from the multiple annotations at each data point. For sufficiently high confidence annotations, a majority voting scheme is then widely adopted as the ground-truth label [31,14,38,20,22,26].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations

Sharmanska

Hernández-Lobato

et al. 2016

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

View full text Add to dashboard Cite

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations

Sharmanska

Hernández-Lobato

et al. 2016

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

View full text Add to dashboard Cite

show abstract

“…Many object tags are available in ImageNet [2], 22K classes resulting in 40K tags, since some classes come with multiple tags, like sea cow, sirenian mammal, sirenian represent one object class. The SUN Attribute dataset [15] has the highest number of scene tags so far, 717 classes like amusement park, coast, squash court. Tags from fine-grained animal categories are available in the 120 dog tag from Stanford Dogs [5] like pekinese, irish terrier, chihuahua, and 200 bird tags from Caltech Birds [22] cowbird, bobolink, blue jay.…”

Section: What Tags Constitute the Long Tail?mentioning

confidence: 99%

Exploring the Long Tail of Social Media Tags

Kordumova

Gemert

Snoek

2016

MultiMedia Modeling

View full text Add to dashboard Cite

Abstract. There are millions of users who tag multimedia content, generating a large vocabulary of tags. Some tags are frequent, while other tags are rarely used following a long tail distribution. For frequent tags, most of the multimedia methods that aim to automatically understand audio-visual content, give excellent results. It is not clear, however, how these methods will perform on rare tags. In this paper we investigate what social tags constitute the long tail and how they perform on two multimedia retrieval scenarios, tag relevance and detector learning. We show common valuable tags within the long tail, and by augmenting them with semantic knowledge, the performance of tag relevance and detector learning improves substantially.

show abstract

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

Jiang

Wang

Shan

et al. 2018

Lecture Notes in Computer Science

106

View full text Add to dashboard Cite

Zero-shot learning (ZSL) aims to recognize objects of novel classes without any training samples of specific classes, which is achieved by exploiting the semantic information and auxiliary datasets. Recently most ZSL approaches focus on learning visual-semantic embeddings to transfer knowledge from the auxiliary datasets to the novel classes. However, few works study whether the semantic information is discriminative or not for the recognition task. To tackle such problem, we propose a coupled dictionary learning approach to align the visual-semantic structures using the class prototypes, where the discriminative information lying in the visual space is utilized to improve the less discriminative semantic space. Then, zero-shot recognition can be performed in different spaces by the simple nearest neighbor approach using the learned class prototypes. Extensive experiments on four benchmark datasets show the effectiveness of the proposed approach.

show abstract

The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding

Cited by 367 publications

References 46 publications

Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations

Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations

Exploring the Long Tail of Social Media Tags

Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition

Contact Info

Product

Resources

About