2014 Tenth European Dependable Computing Conference 2014
DOI: 10.1109/edcc.2014.27
|View full text |Cite
|
Sign up to set email alerts
|

Deviation Estimation between Distributed Data Streams

Abstract: Abstract-The analysis of massive data streams is fundamental in many monitoring applications. In particular, for networks operators, it is a recurrent and crucial issue to determine whether huge data streams, received at their monitored devices, are correlated or not as it may reveal the presence of malicious activities in the network system. We propose a metric, called codeviation, that allows to evaluate the correlation between distributed streams. This metric is inspired from classical metric in statistics … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…Most of the research done so far with this approach has mainly focused on computing functions or statistic measures with a given small error using a little amount of space with respect to the size of the input stream and the set from which items belong to. These include, as examples, the computation of the number of different data items in a given stream [19][20][21], the frequency moments [22], the most frequent data items [22,23], the entropy of the stream [24,25], or the proposition of a metric that allows to estimate a broad class of distance measures between any two massive streams [26], or the correlation among different streams [27].…”
Section: Related Workmentioning
confidence: 99%
“…Most of the research done so far with this approach has mainly focused on computing functions or statistic measures with a given small error using a little amount of space with respect to the size of the input stream and the set from which items belong to. These include, as examples, the computation of the number of different data items in a given stream [19][20][21], the frequency moments [22], the most frequent data items [22,23], the entropy of the stream [24,25], or the proposition of a metric that allows to estimate a broad class of distance measures between any two massive streams [26], or the correlation among different streams [27].…”
Section: Related Workmentioning
confidence: 99%
“…Most existing research using this approach has focused on calculating statistical functions or measurements with a given ε error, using a poly-logarithmic memory amount in the size of the stream and the domain of the incoming data. These functions estimate, for example, the number of distinct data present in a given stream [9], [23], [29], the frequency moments [2], the most frequent elements [2], [19], [40], [37], the entropy of a flow [17], [34], the co-variance between several streams [3] or the relative entropy between a biased stream and a uniform stream [7], [4].…”
Section: B the Value Of The Stream Modelmentioning
confidence: 99%