2014
DOI: 10.1109/tip.2014.2330763
|View full text |Cite
|
Sign up to set email alerts
|

Coupled Binary Embedding for Large-Scale Image Retrieval

Abstract: Abstract-Visual matching is a crucial step in image retrieval based on the bag-of-words (BoW) model. In the baseline method, two keypoints are considered as a matching pair if their SIFT descriptors are quantized to the same visual word. However, the SIFT visual word has two limitations. First, it loses most of its discriminative power during quantization. Second, SIFT only describes the local texture feature. Both drawbacks impair the discriminative power of the BoW model and lead to false positive matches. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 134 publications
(6 citation statements)
references
References 61 publications
0
6
0
Order By: Relevance
“…Binary coding aims to present feature vectors by a compact binary code. For example, in [3], the authors present dual binary embedding applied lo large scale images retrieval. The system uses multiples binary features extracted from SIFT (Scale Invariant Feature Transform) feature and a multi-IDF (Inverse Document Frequency) scheme allowing the association of binary features to the inverted file.…”
Section: Large Scale Multimedia Management Method: Overviewmentioning
confidence: 99%
“…Binary coding aims to present feature vectors by a compact binary code. For example, in [3], the authors present dual binary embedding applied lo large scale images retrieval. The system uses multiples binary features extracted from SIFT (Scale Invariant Feature Transform) feature and a multi-IDF (Inverse Document Frequency) scheme allowing the association of binary features to the inverted file.…”
Section: Large Scale Multimedia Management Method: Overviewmentioning
confidence: 99%
“…In practice, additional data, such as other feature descriptors or cues, can be embedded in the inverted index (together with image IDs) for refined re-ranking, such as Hamming embedding [61], binary embedding [21], semantic-aware coindexing [57], and IR embedding [29]. The embedded information is typically compact (e.g.…”
Section: A Utilizing the Inverted Indexmentioning
confidence: 99%
“…The inverted index can be extended to multidimensional cases, aka inverted multi-index (IMI) [20]. If each index dimension corresponds to a different feature, then IMIlike structures can be used for feature fusion, aka coupled indexing [21]. For a two-dimensional IMI with dimensions M 1 and M 2 , the time complexity is reduced to O(N/M 1 /M 2 ), but the space complexity is increased to O(M 1 M 2 ).…”
Section: Introductionmentioning
confidence: 99%
“…In practice, SIFT and SURF are trade-off options because SIFT yields better accuracy results but slower inference. In 8,9 , they implemented a new approach when creating bag-of-words (BOW) for a large database; despite choosing only SIFT, they decided to add binary descriptions and achieved 79.6% accuracy on a holiday dataset. Deep Learning visual descriptor: Along with lowlevel features, the features extracted by these approaches above seem to exploit only a small amount of image information.…”
Section: Introductionmentioning
confidence: 99%