Large Scale Online Learning of Image Similarity through Ranking

Chechik, Gal; Sharma, Varun; Shalit, Uri; Bengio, Samy

doi:10.1007/978-3-642-02172-5_2

Cited by 420 publications

(640 citation statements)

References 17 publications

Supporting

Mentioning

635

Contrasting

Unclassified

Order By: Relevance

“…Obtaining ground truth for training and testing retrieval algorithms is a challenging task. In the absence of feedback from real users of a retrieval system, alternative approaches have been proposed to obtain ground truth similarity data (e.g., [5]). In this paper we use semantic annotations available in the datasets to generate ground truth similarities, as will be described next.…”

Section: Methodsmentioning

confidence: 99%

“…4 The mixture model outperforms the global and transductive models in all cases. 5 To give a concrete example, in the known-database setting for the SUN data, for a recall of 20% the global and transductive models obtain around 7% precision versus 9% precision of the mixture model. This means that to obtain 20 of the top 100 neighbors, on average 286 images of the 6,000 database images need to be browsed with the global and transductive models.…”

Section: Comparison Of Modelsmentioning

confidence: 99%

“…These types of constraints could be obtained from feedback of users of the retrieval system. In the standard ranking SVMs [18,1,9,8,5], one assumes that a set of elementary pair-wise similarity functions is given and uses the triplets to learn an optimal weigthed combination of these functions. In this global ranking model the same weighted combination is used for all queries independently of where they lie in the query space.…”

Section: Introductionmentioning

confidence: 99%

“…Chechik et al [5] presented an online dual algorithm to find an optimal Z using ranking constraints. The loss function used in [5] is a generalization of the hinge loss for the ranking setting, which is the same loss function used in this paper. Finally, Lanckriet et al [13] proposed a method to learn a global linear combination of predefined kernel functions.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A Latent Variable Ranking Model for Content-Based Retrieval

Carreras

Torralba

2012

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Since their introduction, ranking SVM models [11] have become a powerful tool for training content-based retrieval systems. All we need for training a model are retrieval examples in the form of triplet constraints, i.e. examples specifying that relative to some query, a database item a should be ranked higher than database item b. These types of constraints could be obtained from feedback of users of the retrieval system. Most previous ranking models learn either a global combination of elementary similarity functions or a combination defined with respect to a single database item. Instead, we propose a "coarse to fine" ranking model where given a query we first compute a distribution over "coarse" classes and then use the linear combination that has been optimized for queries of that class. These coarse classes are hidden and need to be induced by the training algorithm. We propose a latent variable ranking model that induces both the latent classes and the weights of the linear combination for each class from ranking triplets. Our experiments over two large image datasets and a text retrieval dataset show the advantages of our model over learning a global combination as well as a combination for each test point (i.e. transductive setting). Furthermore, compared to the transductive approach our model has a clear computational advantages since it does not need to be retrained for each test query.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Comparison Of Modelsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Latent Variable Ranking Model for Content-Based Retrieval

Carreras

Torralba

2012

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…The standard CBVR procedure involves three main components: (i) a query, containing a few video examples of the semantic concept that the user is looking for; (ii) a database, which is used to retrieve videos related to the query concept; and (iii) a ranking function, which sorts the database according to the relevance with respect to the user's query. These three components are typically integrated with the user in a Relevance Feedback (RF) scheme [5] to provide the most relevant videos through several feedback iterations. Figure 1 shows the general RF scheme for retrieval.…”

Section: Introductionmentioning

confidence: 99%

Latent topics-based relevance feedback for video retrieval

Fernández-Beltrán

Pla

2016

Pattern Recognition

View full text Add to dashboard Cite

This work presents a novel Content-Based Video Retrieval approach in order to cope with the semantic gap challenge by means of latent topics.Firstly, a supervised topic model is proposed to transform the classical retrieval approach into a class discovery problem. Subsequently, a new probabilistic ranking function is deduced from that model to tackle the semantic gap between low-level features and high-level concepts. Finally, a shortterm relevance feedback scheme is defined where queries can be initialised with samples from inside or outside the database. Several retrieval simulations have been carried out using three databases and seven different ranking functions to test the performance of the presented approach. Experiments revealed that the proposed ranking function is able to provide a competitive advantage within the content-based retrieval field.

show abstract

PSI: A probabilistic semantic interpretable framework for fine‐grained image ranking

et al. 2018

Asso for Info Science & Tech

View full text Add to dashboard Cite

Image Ranking is one of the key problems in information science research area. However, most current methods focus on increasing the performance, leaving the semantic gap problem, which refers to the learned ranking models are hard to be understood, remaining intact. Therefore, in this article, we aim at learning an interpretable ranking model to tackle the semantic gap in fine‐grained image ranking. We propose to combine attribute‐based representation and online passive‐aggressive (PA) learning based ranking models to achieve this goal. Besides, considering the highly localized instances in fine‐grained image ranking, we introduce a supervised constrained clustering method to gather class‐balanced training instances for local PA‐based models, and incorporate the learned local models into a unified probabilistic framework. Extensive experiments on the benchmark demonstrate that the proposed framework outperforms state‐of‐the‐art methods in terms of accuracy and speed.

show abstract

Large Scale Online Learning of Image Similarity through Ranking

Cited by 420 publications

References 17 publications

A Latent Variable Ranking Model for Content-Based Retrieval

A Latent Variable Ranking Model for Content-Based Retrieval

Latent topics-based relevance feedback for video retrieval

PSI: A probabilistic semantic interpretable framework for fine‐grained image ranking

Contact Info

Product

Resources

About