2010
DOI: 10.1109/tpami.2008.283
|View full text |Cite
|
Sign up to set email alerts
|

Using Language to Learn Structured Appearance Models for Image Annotation

Abstract: Given an unstructured collection of captioned images of cluttered scenes featuring a variety of objects, our goal is to simultaneously learn the names and appearances of the objects. Only a small fraction of local features within any given image are associated with a particular caption word, and captions may contain irrelevant words not associated with any image object. We propose a novel algorithm that uses the repetition of feature neighborhoods across training images and a measure of correspondence with cap… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
44
0

Year Published

2010
2010
2017
2017

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 13 publications
(44 citation statements)
references
References 20 publications
0
44
0
Order By: Relevance
“…A number of researchers have studied the problem of automatic image annotation in recent years [2][3][4][5][6]1]. Given cluttered images of multiple objects paired with noisy captions, these systems can learn meaningful correspondences between caption words and appearance models.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…A number of researchers have studied the problem of automatic image annotation in recent years [2][3][4][5][6]1]. Given cluttered images of multiple objects paired with noisy captions, these systems can learn meaningful correspondences between caption words and appearance models.…”
Section: Related Workmentioning
confidence: 99%
“…In [1], we describe an automatic annotation system that can capture explicit spatial configurations of features while retaining the ability to learn from noisy, unstructured collections of captioned images. Guided by correspondence with caption words, the system iteratively constructs appearance graphs in which vertices represent local features and edges represent spatial relationships between them.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations