2015 IEEE International Parallel and Distributed Processing Symposium 2015
DOI: 10.1109/ipdps.2015.40
|View full text |Cite
|
Sign up to set email alerts
|

Online Top-k-Position Monitoring of Distributed Data Streams

Abstract: Consider n nodes connected to a single coordinator. Each node receives an individual online data stream of numbers and, at any point in time, the coordinator has to know the k nodes currently observing the largest values, for a given k between 1 and n. We design and analyze an algorithm that solves this problem while bounding the amount of messages exchanged between the nodes and the coordinator. Our algorithm employs the idea of using filters which, intuitively speaking, leads to few messages to be sent, if t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
1
1

Relationship

4
2

Authors

Journals

citations
Cited by 7 publications
(19 citation statements)
references
References 16 publications
0
19
0
Order By: Relevance
“…A message will usually, besides a constant number of control bits, consist of a data item, a node ID and an identifier to distinguish between messages of different instances of an algorithm applied in parallel (as done when using standard probability amplification techniques). A broadcast channel is an extension to [2], which was originally proposed in [13] and afterwards applied in [4,5,17]. Between any two time steps we allow a communication protocol to take place, which may use a polylogarithmic number of rounds.…”
Section: Modelmentioning
confidence: 99%
“…A message will usually, besides a constant number of control bits, consist of a data item, a node ID and an identifier to distinguish between messages of different instances of an algorithm applied in parallel (as done when using standard probability amplification techniques). A broadcast channel is an extension to [2], which was originally proposed in [13] and afterwards applied in [4,5,17]. Between any two time steps we allow a communication protocol to take place, which may use a polylogarithmic number of rounds.…”
Section: Modelmentioning
confidence: 99%
“…, ∆}, a node ID and an identifier to distinguish between messages of different instances of an algorithm applied in parallel (as done when using standard probability amplification techniques). Having a broadcast channel is an extension to [1], which was originally proposed in [2] and afterwards applied in [7,8]. For ease of presentation, we assume that not only the server can send broadcast messages, but also the nodes.…”
Section: Model and Problemsmentioning
confidence: 99%
“…Due to huge volumes and velocity, data can neither be completely stored, nor sent to a central server via a network, nor fully processed in real time. Initial results concern, among others, the communication complexity of socalled distributed aggregation problems; Mäcker et al [20] considered the expected message complexity for the top-k Position Monitoring problem. Here, the task is to compute the IDs of the devices that observe the k largest items at every time step.…”
Section: P3 Distributed Data Streams In Dynamic Environments F Meyermentioning
confidence: 99%