Proceedings of International Conference on Multimedia Retrieval 2014
DOI: 10.1145/2578726.2578728
|View full text |Cite
|
Sign up to set email alerts
|

A Cross-media Model for Automatic Image Annotation

Abstract: Automatic image annotation is still an important open problem in multimedia and computer vision. The success of media sharing websites has led to the availability of large collections of images tagged with human-provided labels. Many approaches previously proposed in the literature do not accurately capture the intricate dependencies between image content and annotations. We propose a learning procedure based on Kernel Canonical Correlation Analysis which finds a mapping between visual and textual words by pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 62 publications
(28 citation statements)
references
References 29 publications
0
28
0
Order By: Relevance
“…The MIRFLICKR-25K dataset [15] is composed of 25,000 images from Flickr with 1,386 user tags that occur in at least 20 images, and is split in 12,500 for training and 12,500 for testing, with exactly the same partition as [4,12]. In addition ground truth annotations for 18 tags are provided on the whole set.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The MIRFLICKR-25K dataset [15] is composed of 25,000 images from Flickr with 1,386 user tags that occur in at least 20 images, and is split in 12,500 for training and 12,500 for testing, with exactly the same partition as [4,12]. In addition ground truth annotations for 18 tags are provided on the whole set.…”
Section: Methodsmentioning
confidence: 99%
“…As low-level features are hardly semantically related, Guillaumin et al [12] and Verma et al [30] proposed to learn a weighted metric to improve on precision. Ballan et al [4] proposed using KCCA to learn mid-level features to be used with previous nearest neighbors approaches.…”
Section: Previous Workmentioning
confidence: 99%
See 1 more Smart Citation
“…While the number of tags that can be modeled is increasing, it remains unclear how to select a proper number of tags to annotate an unlabeled image. This problem is overlooked, because most of the existing works either assess the top-k ranked tags [3,4] or the entire tag ranking list [6].…”
Section: Related Workmentioning
confidence: 99%
“…Quite a few methods have been proposed for image annotation, either by building visual classifiers per tag [1,2] or by propagating tags from visually similar images [3,4]. Given a novel image and a tag vocabulary, these methods first compute each tag's relevance score with respect to the given image, and sort the tags in descending order by their scores.…”
Section: Introductionmentioning
confidence: 99%