Optimal operator deployment and replication for elastic distributed data stream processing

Cardellini, Valeria; Presti, Francesco Lo; Nardelli, Matteo; Russo, Gabriele Russo

doi:10.1002/cpe.4334

Cited by 47 publications

(68 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Bugra et al [1] proposed a system of auto-parallelization that dynamically adjusts the number of parallel channels to achieve the best performance based on changes in the workload. Marangozova-Martin et al [2] proposed multi-level elasticity in stream processing environments with low latency and minimum resources, and Cardellini et al [3] dealt with effective runtime management in terms of placement and replication decisions while considering the application and resource heterogeneity and the migration overhead, so to select the optimal adaptation strategy that can minimize migration costs while satisfying the application QoS requirements. These papers' objective was to achieve the elastic scalability for stream processing systems based on the individual machines or nodes.…”

Section: Related Workmentioning

confidence: 99%

Serverless Stream Processing with Elastic Multi-M/M/s/K Queue System

2019

IJEAT

View full text Add to dashboard Cite

The high throughput - low latency stream processing systems are required to be elastic enough to scale for varying load spike on-demand. However, in the current stream processing systems, the load shedding is observed which impacts the final accuracy. In order to get rid of this issue, the elasticity can be implemented in all kinds of resources involved in the stream processing systems. This paper focuses on providing the elastic scalability in queues and Serverless functions for the event stream processing systems. First, we explain the need of elastic multi-queue with Serverless function in detail for event stream processing, and then will propose an algorithm for elastic scalability of multi-M/M/s/K Queuing with Serverless functions for the efficient stream processing. The experiment result shows that the system scales very well in short span of time with the help of our proposed algorithm. The increased availability in turn helps improving the high processing throughput in low latency.

show abstract

Section: Related Workmentioning

confidence: 99%

Serverless Stream Processing with Elastic Multi-M/M/s/K Queue System

2019

IJEAT

View full text Add to dashboard Cite

show abstract

“…Most of the existing system architectures consider a centralized management solution, where a single coordination entity exploits its global knowledge about the entire system state to plan the proper adaptation actions (e.g., [6,7,[17][18][19][20][21]). Although this approach can potentially achieve a global optimum adaptation strategy, it may not be suitable for a wide-area distributed environment, because of the tight coupling among the system components and the fact that a central manager represents a bottleneck in a large-scale system due to monitoring and planning overheads.…”

Section: System Architecturesmentioning

confidence: 99%

“…Other works (e.g., [7,[35][36][37][38]) use more complex centralized policies to determine the scaling decisions, exploiting optimization methods that rely on the knowledge of a global model, such as integer linear programming [7], control theory [35], queueing theory [36], and fuzzy logic [37]. In [7], we presented an integer linear programming problem for the run-time elasticity management of DSP applications that takes into account the application reconfiguration costs after scaling operations and aims to minimize them while satisfying the application performance requirements. Lohrmann et al [36] proposed a strategy that enforces latency constraints by relying on a predictive latency model based on queueing theory.…”

Section: Elasticity Policiesmentioning

confidence: 99%

“…Most of the approaches proposed in the literature for managing DSP applications have been designed for cluster environments, where a single centralized control component takes deployment decisions by exploiting a global system view (e.g., [5][6][7]). These solutions typically do not scale well in a highly distributed environment, given the spatial distribution, heterogeneity, and sheer size of the infrastructure itself.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Multi-Level Elasticity for Wide-Area Data Streaming Systems: A Reinforcement Learning Approach

et al. 2018

Self Cite

View full text Add to dashboard Cite

The capability of efficiently processing the data streams emitted by nowadays ubiquitous sensing devices enables the development of new intelligent services. Data Stream Processing (DSP) applications allow for processing huge volumes of data in near real-time. To keep up with the high volume and velocity of data, these applications can elastically scale their execution on multiple computing resources to process the incoming data flow in parallel. Being that data sources and consumers are usually located at the network edges, nowadays the presence of geo-distributed computing resources represents an attractive environment for DSP. However, controlling the applications and the processing infrastructure in such wide-area environments represents a significant challenge. In this paper, we present a hierarchical solution for the autonomous control of elastic DSP applications and infrastructures. It consists of a two-layered hierarchical solution, where centralized components coordinate subordinated distributed managers, which, in turn, locally control the elastic adaptation of the application components and deployment regions. Exploiting this framework, we design several self-adaptation policies, including reinforcement learning based solutions. We show the benefits of the presented self-adaptation policies with respect to static provisioning solutions, and discuss the strengths of reinforcement learning based approaches, which learn from experience how to optimize the application performance and resource allocation.

show abstract

“…A DSP application is commonly structured as a directed graph whose vertices are data sources, operators, and data sinks, whereas edges represent the data streams between operators. The application has one or multiple data sources that produce an input data stream, operators that perform transformations over the streaming data (e.g., filtering, aggregation, convolution) until the data reaches a data sink [3]. DSP applications are traditionally deployed on the Cloud in order to explore its virtually unlimited number of resources.…”

Section: Introductionmentioning

confidence: 99%

Multi-Objective Reinforcement Learning for Reconfiguring Data Stream Analytics on Edge Computing

Veith

Souza

Assunção

et al. 2019

Proceedings of the 48th International Conference on Parallel Processing

View full text Add to dashboard Cite

There is increasing demand for handling massive amounts of data in a timely manner via Distributed Stream Processing (DSP). A DSP application is often structured as a directed graph whose vertices are operators that perform transformations over the incoming data and edges representing the data streams between operators. DSP applications are traditionally deployed on the Cloud in order to explore the virtually unlimited number of resources. Edge computing has emerged as a suitable paradigm for executing parts of DSP applications by offloading certain operators from the Cloud and placing them close to where the data is generated, hence minimising the overall time required to process data events (i.e., the end-toend latency). The operator reconfiguration consists of changing the initial placement by reassigning operators to different devices given target performance metrics. In this work, we model the operator reconfiguration as a Reinforcement Learning (RL) problem and define a multi-objective reward considering metrics regarding operator reconfiguration, and infrastructure and application improvement. Experimental results show that reconfiguration algorithms that minimise only end-to-end processing latency can have a substantial impact on WAN traffic and communication cost. The results also demonstrate that when reconfiguring operators, RL algorithms improve by over 50% the performance of the initial placement provided by state-of-the-art approaches.

show abstract

Optimal operator deployment and replication for elastic distributed data stream processing

Cited by 47 publications

References 43 publications

Serverless Stream Processing with Elastic Multi-M/M/s/K Queue System

Serverless Stream Processing with Elastic Multi-M/M/s/K Queue System

Multi-Level Elasticity for Wide-Area Data Streaming Systems: A Reinforcement Learning Approach

Multi-Objective Reinforcement Learning for Reconfiguring Data Stream Analytics on Edge Computing

Contact Info

Product

Resources

About