2018
DOI: 10.1016/j.jpdc.2018.02.003
|View full text |Cite
|
Sign up to set email alerts
|

Model-driven scheduling for distributed stream processing systems

Abstract: Distributed Stream Processing frameworks are being commonly used with the evolution of Internet of Things(IoT). These frameworks are designed to adapt to the dynamic input message rate by scaling in/out.Apache Storm, originally developed by Twitter is a widely used stream processing engine while others includes Flink [8] Spark streaming [73]. For running the streaming applications successfully there is need to know the optimal resource requirement, as over-estimation of resources adds extra cost.So we need som… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
28
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 41 publications
(28 citation statements)
references
References 47 publications
0
28
0
Order By: Relevance
“…Without loss of generality, we assume that the tuple processing time of all the tasks is almost the same (this is a logical assumption used also by shuffle grouping, see [8], [14]). Under this hypothesis, we can divide the overall processing into a set of well-defined processing steps and stages, terms that are defined in the following subsection.…”
Section: A Preliminaries In Stormmentioning
confidence: 99%
See 2 more Smart Citations
“…Without loss of generality, we assume that the tuple processing time of all the tasks is almost the same (this is a logical assumption used also by shuffle grouping, see [8], [14]). Under this hypothesis, we can divide the overall processing into a set of well-defined processing steps and stages, terms that are defined in the following subsection.…”
Section: A Preliminaries In Stormmentioning
confidence: 99%
“…A dynamic scheme could adopt data parallelism and scale out the number of parallel instances for the operator that is overloaded and becomes a bottleneck and/or increase the number of VMs that run in the cluster [14]. Possible task migrations would be needed to reduce resource utilization imbalances between nodes.…”
Section: Bitmapmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, the latter drawback is strongest for multimedia data, for example, in applications that use video and image acquisition devices. For that reason, it is difficult to implement a centralized multimedia analysis system in the cloud [ 9 , 10 ].…”
Section: Introductionmentioning
confidence: 99%
“…A related but separate problem is to determine the new resource allocation for the dataflow (number and sizes of VMs) and the new mapping of its tasks onto the VMs. This is outside the scope of this paper, but has been examined elsewhere [4]. Having a new schedule is a precursor to the dynamic enactment of the schedule, which we target in the current paper.…”
Section: Introductionmentioning
confidence: 99%