2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS) 2018
DOI: 10.1109/icdcs.2018.00109
|View full text |Cite
|
Sign up to set email alerts
|

Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds

Abstract: The pervasive availability of streaming data is driving interest in distributed Fast Data platforms for streaming applications. Such latency-sensitive applications need to respond to dynamism in the input rates and task behavior using scale-in and -out on elastic Cloud resources. Platforms like Apache Storm do not provide robust capabilities for responding to such dynamism and for rapid task migration across VMs. We propose several dataflow checkpoint and migration approaches that allow a running streaming dat… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 24 publications
0
11
0
Order By: Relevance
“…In this case, response delay and task loss occur during real-time stream processing of data, which causes failure in task completion within the deadline. For resolving these issues, various scheduling schemes have been examined in which the loads on the worker nodes in a real-time stream environment are considered [10][11][12][13][14][15][16][17][18][19][20][21].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…In this case, response delay and task loss occur during real-time stream processing of data, which causes failure in task completion within the deadline. For resolving these issues, various scheduling schemes have been examined in which the loads on the worker nodes in a real-time stream environment are considered [10][11][12][13][14][15][16][17][18][19][20][21].…”
Section: Related Workmentioning
confidence: 99%
“…A study [16] proposed mechanisms to dynamically enact the rescheduling and migration of tasks in a streaming dataflow from one set of virtual machines to another reliably and rapidly. They proposed two task migration strategies such as Drain-Checkpoint-Restore(DCR) and Capture-Checkpoint-Resume(CCR) in Storm by using Redis that is a distributed key/value store.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Indeed, by letting multiple instances process the input stream in parallel, operators can efficiently handle load peaks, while avoiding resource wastage in low-load periods. However, operator scaling is particularly challenging, especially in presence of stateful operators, as each parallelism adaptation requires the execution of a reconfiguration protocol to preserve stream and state integrity, often causing significant overhead (see, e.g., [18]). As surveyed in [15], a variety of different techniques have been used to define operator scaling policies, including threshold-based heuristics [6], control theory [4], queueing theory [9], reinforcement learning [7].…”
Section: Related Workmentioning
confidence: 99%
“…The proposed model prioritizes the tasks sent by the federation scheduler with two multi-resource fair scheduling algorithms for cloud and federation. In addition, Shukla et al [34] proposed a mechanism for migrating running streaming dataflow across VMs. Tan et al [35] proposed a Cooperative Coevolution Genetic Programming (CCGP) hyper-heuristic approach.…”
Section: Related Workmentioning
confidence: 99%