2009
DOI: 10.1007/978-3-642-02172-5_2
|View full text |Cite
|
Sign up to set email alerts
|

Large Scale Online Learning of Image Similarity through Ranking

Abstract: Learning a measure of similarity between pairs of objects is an important generic problem in machine learning. It is particularly useful in large scale applications like searching for an image that is similar to a given image or finding videos that are relevant to a given video. In these tasks, users look for objects that are not only visually similar but also semantically related to a given object. Unfortunately, the approaches that exist today for learning such semantic similarity do not scale to large datas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

4
635
0
1

Year Published

2012
2012
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 420 publications
(640 citation statements)
references
References 17 publications
4
635
0
1
Order By: Relevance
“…Obtaining ground truth for training and testing retrieval algorithms is a challenging task. In the absence of feedback from real users of a retrieval system, alternative approaches have been proposed to obtain ground truth similarity data (e.g., [5]). In this paper we use semantic annotations available in the datasets to generate ground truth similarities, as will be described next.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Obtaining ground truth for training and testing retrieval algorithms is a challenging task. In the absence of feedback from real users of a retrieval system, alternative approaches have been proposed to obtain ground truth similarity data (e.g., [5]). In this paper we use semantic annotations available in the datasets to generate ground truth similarities, as will be described next.…”
Section: Methodsmentioning
confidence: 99%
“…4 The mixture model outperforms the global and transductive models in all cases. 5 To give a concrete example, in the known-database setting for the SUN data, for a recall of 20% the global and transductive models obtain around 7% precision versus 9% precision of the mixture model. This means that to obtain 20 of the top 100 neighbors, on average 286 images of the 6,000 database images need to be browsed with the global and transductive models.…”
Section: Comparison Of Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…The standard CBVR procedure involves three main components: (i) a query, containing a few video examples of the semantic concept that the user is looking for; (ii) a database, which is used to retrieve videos related to the query concept; and (iii) a ranking function, which sorts the database according to the relevance with respect to the user's query. These three components are typically integrated with the user in a Relevance Feedback (RF) scheme [5] to provide the most relevant videos through several feedback iterations. Figure 1 shows the general RF scheme for retrieval.…”
Section: Introductionmentioning
confidence: 99%