2017 IEEE 33rd International Conference on Data Engineering (ICDE) 2017
DOI: 10.1109/icde.2017.81
|View full text |Cite
|
Sign up to set email alerts
|

Integrative Dynamic Reconfiguration in a Parallel Stream Processing Engine

Abstract: Load balancing, operator instance collocations and horizontal scaling are critical issues in Parallel Stream Processing Engines to achieve low data processing latency, optimized cluster utilization and minimized communication cost respectively. In previous work, these issues are typically tackled separately and independently. We argue that these problems are tightly coupled in the sense that they all need to determine the allocations of workloads and migrate computational states at runtime. Optimizing them ind… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 32 publications
0
12
0
Order By: Relevance
“…e authors use a tiered bin-packing problem, which allows for prioritizing operators, and solve the optimization problem using simulation-based latency estimation. Madsen et al [94] provide a solution to integrate load balancing, collocating (i.e., placement), and horizontal scaling (i.e., determining the parallelization degree of operators). ey model the problem as a Mixed-Integer Linear Program, solved with a heuristic greedy algorithm.…”
Section: Operator Placementmentioning
confidence: 99%
“…e authors use a tiered bin-packing problem, which allows for prioritizing operators, and solve the optimization problem using simulation-based latency estimation. Madsen et al [94] provide a solution to integrate load balancing, collocating (i.e., placement), and horizontal scaling (i.e., determining the parallelization degree of operators). ey model the problem as a Mixed-Integer Linear Program, solved with a heuristic greedy algorithm.…”
Section: Operator Placementmentioning
confidence: 99%
“…The works most closely related to ours have been presented in Heinze et al() and Madsen et al Heinze et al() propose a model to estimate latency spikes caused by operator reallocations and use it to define a placement heuristic. It results in a solution that places only the newly added operators and minimizes the latency violations.…”
Section: Related Workmentioning
confidence: 98%
“…Moreover, it represents a benchmark against which heuristics can be compared. Madsen et al formulate a MILP optimization problem aimed to control load balancing and horizontal scaling, which works in combination with a heuristic in charge of collocating operators on computing nodes. Similarly to our work, their solution considers operator collocation (in our case, it follows from minimizing the response time) and state‐migration overheads.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Thus far, state has been shown to be effective in several isolated application scenarios (i.e., fault tolerance, load balancing, elasticity). However, state can also be used to simultaneously address multiple scenarios simultaneously (e.g., scalability, fault tolerance [36,43,80,81,111]). It is in this scenario that multiobjective or integrative optimization (IO) arises.…”
Section: Integrative Optimizationmentioning
confidence: 99%