2017
DOI: 10.1007/s10618-017-0526-x
|View full text |Cite
|
Sign up to set email alerts
|

Fast and accurate mining of correlated heavy hitters

Abstract: The problem of mining Correlated Heavy Hitters (CHH) from a twodimensional data stream has been introduced recently, and a deterministic algorithm based on the use of the Misra-Gries algorithm has been proposed by Lahiri et al. to solve it. In this paper we present a new counter-based algorithm for tracking CHHs, formally prove its error bounds and correctness and show, through extensive experimental results, that our algorithm outperforms the Misra-Gries based algorithm with regard to accuracy and speed whils… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
11
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 43 publications
0
11
0
Order By: Relevance
“…Recent results on finding frequent items in streams include models where items that are more recent in the stream have higher weight [25,3], as well as many applications, e.g., finding frequent elements in two-dimensional data streams [20,8].…”
Section: Previous and Related Workmentioning
confidence: 99%
“…Recent results on finding frequent items in streams include models where items that are more recent in the stream have higher weight [25,3], as well as many applications, e.g., finding frequent elements in two-dimensional data streams [20,8].…”
Section: Previous and Related Workmentioning
confidence: 99%
“…Pairwise independent hash functions are used to map stream's items to corresponding cells in the sketch. Sketch-based algorithms include COUNTSKETCH by Charikar et al [5], GROUP TEST [22] and COUNT-MIN [23] by Cormode and Muthukrishnan, HCOUNT [24] by Jin et al and CMSS [11] by Cafaro et al Algorithms for Correlated Heavy Hitters (CHHs) have been recently proposed by Lahiri et al [25] and by Epicoco et al [26] in which a fast and more accurate algorithm for mining CHHs is presented.…”
Section: Related Workmentioning
confidence: 99%
“…Accelerator based algorithms include Govindaraju et al [42], Erra and Frola [43] and Cafaro et al [41,44]. Pulimeno et al [45] present a message-passing based version of the CHHs algorithm [26]; a parallel message-passing based version of [32] is presented in [46].…”
Section: Related Workmentioning
confidence: 99%
“…In [24], a faster and more accurate algorithm for mining CHHs is proposed. The Cascading Space-Saving Correlated Heavy Hitters (CSSCHH) algorithm exploits the basic ideas of Space-Saving, combining two summaries for tracking the primary item frequencies and the tuple frequencies.…”
Section: Related Workmentioning
confidence: 99%