VLDB '02: Proceedings of the 28th International Conference on Very Large Databases 2002
DOI: 10.1016/b978-155860869-6/50037-8
|View full text |Cite
|
Sign up to set email alerts
|

Comparing Data Streams Using Hamming Norms (How to Zero In)

Abstract: Abstract-Massive data streams are now fundamental to many data processing applications. For example, Internet routers produce large scale diagnostic data streams. Such streams are rarely stored in traditional databases and instead must be processed "on the fly" as they are produced. Similarly, sensor networks produce multiple data streams of observations from their sensors. There is growing focus on manipulating data streams and, hence, there is a need to identify basic operations of interest in managing data … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0

Year Published

2005
2005
2015
2015

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(49 citation statements)
references
References 16 publications
0
49
0
Order By: Relevance
“…In Section 2, we provide some backgrounds on the contemporary summary structures, namely the Count-Min sketches [10] and the l 0 -sketch [9]. We note that the sketches possess additivity and their space is independent of the number of vector components, which are important properties that we make use of in later sections.…”
Section: Organization Of the Papermentioning
confidence: 99%
See 1 more Smart Citation
“…In Section 2, we provide some backgrounds on the contemporary summary structures, namely the Count-Min sketches [10] and the l 0 -sketch [9]. We note that the sketches possess additivity and their space is independent of the number of vector components, which are important properties that we make use of in later sections.…”
Section: Organization Of the Papermentioning
confidence: 99%
“…When the magnitude of each component of a is bounded from above by some value U , we can compute its l 0 -sketch [9], denoted L 0 ( a), which allows us to approximate, a 0 , the number of non-zero entries in a. …”
Section: -Sketchmentioning
confidence: 99%
“…We estimate the Hamming norm L 0 = |{i: C i = 0}| [4], the number of distinct keys in a stream with deletions, as follows. We use the structure of Kane et al [19], which provides an -approximation L 0 with 2/3 success probability in both strict and non-strict streams.…”
Section: Estimationmentioning
confidence: 99%
“…Nowadays, a significant number of applications require the manipulation of data streams [6,2,7,17,8,10]. Examples of these applications are online stock analysis, computer network monitoring, network traffic management, earthquake prediction.…”
Section: Introductionmentioning
confidence: 99%