2014
DOI: 10.1007/978-3-319-08404-6_12
|View full text |Cite
|
Sign up to set email alerts
|

Approximately Minwise Independence with Twisted Tabulation

Abstract: A random hash function h is ε-minwise if for any set S, |S| " n, and element x P S, Prrhpxq " min hpSqs " p1˘εq{n. Minwise hash functions with low bias ε have widespread applications within similarity estimation.Hashing from a universe rus, the twisted tabulation hashing of Pǎtraşcu and Thorup [SODA'13] makes c " Op1q lookups in tables of size u 1{c . Twisted tabulation was invented to get good concentration for hashing based sampling. Here we show that twisted tabulation yields Op1{u 1{c q-minwise hashing.In … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 22 publications
0
9
0
Order By: Relevance
“…Minwise independence Concerning minwise hashing, Dahlgaard and Thorup [15] have proved that twisted tabulation yields the following strengthening of Theorem 4 for simple tabulation.…”
Section: Theorem 5 ([42]) Choose a Random C-character Twisted Tabulatmentioning
confidence: 96%
See 1 more Smart Citation
“…Minwise independence Concerning minwise hashing, Dahlgaard and Thorup [15] have proved that twisted tabulation yields the following strengthening of Theorem 4 for simple tabulation.…”
Section: Theorem 5 ([42]) Choose a Random C-character Twisted Tabulatmentioning
confidence: 96%
“…In this paper we will survey recent results from [12,13,14,15,41,42,50] showing how simple realistic hashing schemes based on tabulation provide unexpectedly strong guarantees for many popular randomized algorithms, e.g., linear probing, Cuckoo hashing, min-wise independence, treaps, planar partitions, powerof-two-choices, Chernoff-style concentration bounds, and even high independence. The survey is from a users perspective, explaining how these tabulation schemes can be applied.…”
Section: Introductionmentioning
confidence: 99%
“…Min-wise independence unfortunately does not follow as a consequence of strong universality, though you can approximate it with sufficiently high degrees of k-independence [24] or by using a twisted variant of tabulation hashing [10].…”
Section: Hashing and Permutationsmentioning
confidence: 99%
“…Generating m hash functions simply requires the generation of m sets of lookup tables. A more recent variation of simple tabulation hashing, named twisted tabulation hashing, is reported to be even better for minwise hashing [15], although it will not be described because it is not used in our current implementation. Table 1 shows simple tabulation minwise hashes for the bigram-tokenised strings 'john' and 'jon'.…”
Section: 10101011 01110101 01010011mentioning
confidence: 99%