2015
DOI: 10.1007/s00778-015-0382-5
|View full text |Cite
|
Sign up to set email alerts
|

Conditional heavy hitters: detecting interesting correlations in data streams

Abstract: The notion of heavy hitters-items that make up a large fraction of the population-has been successfully used in a variety of applications across sensor and RFID monitoring, network data analysis, event mining, and more. Yet this notion often fails to capture the semantics we desire when we observe data in the form of correlated pairs. Here, we are interested in items that are conditionally frequent: when a particular item is frequent within the context of its parent item. In this work, we introduce and formali… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(3 citation statements)
references
References 36 publications
0
3
0
Order By: Relevance
“…Algorithms for finding frequent items include counter-based approaches (Manku and Motwani, 2002;Demaine et al, 2002;Karp et al, 2003;Metwally et al, 2005), quantile algorithms (Greenwald and Khanna, 2001;Shrivastava et al, 2004), and sketch-based methods (Charikar et al, 2002;Cormode and Muthukrishnan, 2005b). Mirylenka et al (Mirylenka et al, 2015) develop streaming algorithms for finding conditional heavy hitters, i.e. items that are frequent in the context of a separate "parent" item.…”
Section: Related Workmentioning
confidence: 99%
“…Algorithms for finding frequent items include counter-based approaches (Manku and Motwani, 2002;Demaine et al, 2002;Karp et al, 2003;Metwally et al, 2005), quantile algorithms (Greenwald and Khanna, 2001;Shrivastava et al, 2004), and sketch-based methods (Charikar et al, 2002;Cormode and Muthukrishnan, 2005b). Mirylenka et al (Mirylenka et al, 2015) develop streaming algorithms for finding conditional heavy hitters, i.e. items that are frequent in the context of a separate "parent" item.…”
Section: Related Workmentioning
confidence: 99%
“…In [25], Mirylenka et al introduce the notion of Conditional Heavy Hitters and compare it with other related problems, such as association rules and Correlated Heavy Hitters, highlighting how solving these problems actually leads to different outputs, each emphasizing particular aspects of the input data stream. A group of algorithms is proposed and experimentally evaluated with respect to the approximate mining of Conditional Heavy Hitters.…”
Section: Related Workmentioning
confidence: 99%
“…Many management applications can benefit from a function that can find them efficiently, such as congestion control by dynamically scheduling elephant flows [2], network capacity planning [3], anomaly detection [4], and caching of forwarding table entries [5]. Such a function not only is important in networking measurements [6]- [15], but also has applications beyond networking in areas such as data mining [16]- [18], information retrieval [19], databases [20], and security [21].…”
Section: Introduction a Background And Motivationmentioning
confidence: 99%