2014 Data Compression Conference 2014
DOI: 10.1109/dcc.2014.37
|View full text |Cite
|
Sign up to set email alerts
|

Compression Schemes for Similarity Queries

Abstract: We consider compression of sequences in a database so that similarity queries can be performed efficiently in the compressed domain. The fundamental limits for this problem setting, which characterize the tradeoff between compression rate and reliability of the answers to the queries, have been characterized in past work. However, how to approach these limits in practice has remained largely unexplored.Recently, we proposed a scheme for this task that is based on existing lossy compression algorithms, for the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…The TC-scheme is an improved version of the LC-scheme by optimizing jointly the quantization distortion and the expected query codeword distance. The results in [5] show that the compression rate of TC-can achieve the identification rate for the case with binary sources and the Hamming distance.…”
Section: Introductionmentioning
confidence: 92%
See 3 more Smart Citations
“…The TC-scheme is an improved version of the LC-scheme by optimizing jointly the quantization distortion and the expected query codeword distance. The results in [5] show that the compression rate of TC-can achieve the identification rate for the case with binary sources and the Hamming distance.…”
Section: Introductionmentioning
confidence: 92%
“…Proof. Given the stationary Gaussian source { Xn }, we can decompose the source into vectors X of M successive random variables and describe those vectors with a M th order multivariate Gaussian distribution (5). Then we can apply the KLT transform on the decomposed vectors X = A T M X, where A M is the eigenmatrix of the covariance matrix C M .…”
Section: Identification Rate Of Gaussian Sources With Memorymentioning
confidence: 99%
See 2 more Smart Citations
“…the TC-△ and the LC-△ schemes are optimal. However, if X and Y are not equiprobable (and the distortion measure is still Hamming), the LC-△ scheme differs from the TC-△ scheme (see [35,Fig. 2]).…”
Section: Special Casesmentioning
confidence: 99%