A comparison of extended fingerprint hashing and locality sensitive hashing for binary audio fingerprints

Moravec, Kimberly; Cox, Ingemar J.

doi:10.1145/1991996.1992027

Cited by 7 publications

(3 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Substantial work has been conducted on database scalability issues, and various mechanisms are proposed. Among them, distributed key‐value stores exhibit great efficiency to store and retrieve large‐volume of data, and have been widely used in various applications such as content‐based image processing , bio‐info data mining , spatial data processing . There are several open‐source implementations available, including HBase , Cassandra and Voldemort .…”

Section: Introductionmentioning

confidence: 99%

“…We design a strict order‐preserving hash function to map the nearby objects in high‐dimensional spaces onto adjacent keys in key‐value stores using locality sensitive hashing (LSH) . LSH has been proven to be very efficient for many applications, where a transformation from a higher‐dimensional space to a lower‐dimensional space is needed.…”

Section: Introductionmentioning

confidence: 99%

“…Locality sensitive hash algorithm is an effective approach that has been widely used in large‐scale data management systems such as graph processing , data mining and information retrieval . The basic idea of LSH is to group objects with similar attributes in the high‐dimensional space into the same bucket.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

HDKV: supporting efficient high‐dimensional similarity search in key‐value stores

Zhou

Han

Zhang

et al. 2012

Concurrency and Computation

View full text Add to dashboard Cite

SUMMARYKey‐value stores are widely used on large‐scale data management in the cloud environment. However, they can only naturally support key‐based queries, and do not have efficient solutions for value‐based queries. Thus, dealing with high‐dimensional data in key‐value stores is still a big challenge. State‐of‐the‐art solutions apply value‐based tree‐structure indexes to solve this issue. These methods suffer from the curse of dimensionality and cannot achieve satisfactory performance. They also bring serious load unbalancing problem among servers, and result in dramatic system scalability degradation.Meanwhile, similarity search in high‐dimensional data space becomes more and more popular in today's cloud applications. Due to the lack of efficient algorithms for value‐based queries, users have to wait for a long time before the results are returned. To address this issue, we propose a novel approach called high‐dimensional similarity query in key‐value stores (HDKV), which can generate similarity results in a short time and maintain good database scalability. In HDKV, a strict order‐preserving hash function is designed to map nearby objects in the high‐dimensional space onto adjacent keys of a continuous linear space in key‐value stores. With this strategy, many expensive random accesses are replaced with more efficient scan accesses. The experimental evaluation on real world data set shows that compared to the state‐of‐the‐art methods, HDKV can dramatically reduce the search time with little impact on the accuracy. Copyright © 2012 John Wiley & Sons, Ltd.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%