Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds

Shukla, Aparna; Simmhan, Yogesh

doi:10.1109/icdcs.2018.00109

Cited by 18 publications

(11 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this case, response delay and task loss occur during real-time stream processing of data, which causes failure in task completion within the deadline. For resolving these issues, various scheduling schemes have been examined in which the loads on the worker nodes in a real-time stream environment are considered [10][11][12][13][14][15][16][17][18][19][20][21].…”

Section: Related Workmentioning

confidence: 99%

“…A study [16] proposed mechanisms to dynamically enact the rescheduling and migration of tasks in a streaming dataflow from one set of virtual machines to another reliably and rapidly. They proposed two task migration strategies such as Drain-Checkpoint-Restore(DCR) and Capture-Checkpoint-Resume(CCR) in Storm by using Redis that is a distributed key/value store.…”

Section: Related Workmentioning

confidence: 99%

“…Various studies have been conducted for resolving this problem [10][11][12][13][14][15][16][17][18][19][20][21]. In some study, simply calculating the traffic cannot solve the problem of load imbalance on worker nodes.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Dynamic Task Scheduling Scheme for Processing Real-Time Stream Data in Storm Environments

et al. 2021

View full text Add to dashboard Cite

Owing to the recent advancements in Internet of Things technology, social media, and mobile devices, real-time stream balancing processing systems are commonly used to process vast amounts of data generated in various media. In this paper, we propose a dynamic task scheduling scheme considering task deadlines and node resources. The proposed scheme performs dynamic scheduling using a heterogeneous cluster consisting of various nodes with different performances. Additionally, the loads of the nodes considering the task deadlines are balanced by different task scheduling based on three defined load types. Based on diverse performance evaluations it is shown that the proposed scheme outperforms the conventional schemes.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Dynamic Task Scheduling Scheme for Processing Real-Time Stream Data in Storm Environments

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Indeed, by letting multiple instances process the input stream in parallel, operators can efficiently handle load peaks, while avoiding resource wastage in low-load periods. However, operator scaling is particularly challenging, especially in presence of stateful operators, as each parallelism adaptation requires the execution of a reconfiguration protocol to preserve stream and state integrity, often causing significant overhead (see, e.g., [18]). As surveyed in [15], a variety of different techniques have been used to define operator scaling policies, including threshold-based heuristics [6], control theory [4], queueing theory [9], reinforcement learning [7].…”

Section: Related Workmentioning

confidence: 99%

Model-based auto-scaling of distributed data stream processing applications

Russo

2020

Proceedings of the 21st International Middleware Conference Doctoral Symposium

View full text Add to dashboard Cite

“…The proposed model prioritizes the tasks sent by the federation scheduler with two multi-resource fair scheduling algorithms for cloud and federation. In addition, Shukla et al [34] proposed a mechanism for migrating running streaming dataflow across VMs. Tan et al [35] proposed a Cooperative Coevolution Genetic Programming (CCGP) hyper-heuristic approach.…”

Section: Related Workmentioning

confidence: 99%

Minimizing Resource Waste in Heterogeneous Resource Allocation for Data Stream Processing on Clouds

Chung

Lee

et al. 2020

Applied Sciences

View full text Add to dashboard Cite

Resource allocation is vital for improving system performance in big data processing. The resource demand for various applications can be heterogeneous in cloud computing. Therefore, a resource gap occurs while some resource capacities are exhausted and other resource capacities on the same server are still available. This phenomenon is more apparent when the computing resources are more heterogeneous. Previous resource-allocation algorithms paid limited attention to this situation. When such an algorithm is applied to a server with heterogeneous resources, resource allocation may result in considerable resource wastage for the available but unused resources. To reduce resource wastage, a resource-allocation algorithm, called the minimizing resource gap (MRG) algorithm, for heterogeneous resources is proposed in this study. In MRG, the gap between resource usages for each server in cloud computing and the resource demands among various applications are considered. When an application is launched, MRG calculates resource usage and allocates resources to the server with the minimized usage gap to reduce the amount of available but unused resources. To demonstrate MRG performance, the MRG algorithm was implemented in Apache Spark. CPU- and memory-intensive applications were applied as benchmarks with different resource demands. Experimental results proved the superiority of the proposed MRG approach for improving the system utilization to reduce the overall completion time by up to 24.7% for heterogeneous servers in cloud computing.

show abstract

Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds

Cited by 18 publications

References 24 publications

Dynamic Task Scheduling Scheme for Processing Real-Time Stream Data in Storm Environments

Dynamic Task Scheduling Scheme for Processing Real-Time Stream Data in Storm Environments

Model-based auto-scaling of distributed data stream processing applications

Minimizing Resource Waste in Heterogeneous Resource Allocation for Data Stream Processing on Clouds

Contact Info

Product

Resources

About