A Queueing Network Model for Performance Prediction of Apache Cassandra

NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium

Buyya

Casale

2018

Self Cite

Apache Cassandra has emerged as one of the most widely adopted NoSQL databases. However, there is still a limited understanding on how to optimally operate Cassandra in the cloud using autoscaling methods, by which resources can be scaled up or down to reduce operational costs and meet servicelevel objectives (SLOs). To address this limitation, we present PAX, a partition-aware elastic resource management system for Apache Cassandra. PAX uses low-overhead query sampling and knowledge of the datapartitioning across the nodes to automatically adapt capacity in Cassandra clusters. Differently from existing autoscaling methods for Cassandra, which incur large acquisition times for new nodes, PAX exploits Cassandra's hinted handoff mechanism and a shared hints storage to minimize the time needed to acquire a node into the cluster. We propose a reactive and a proactive implementation of PAX and compare their performance against different workloads with varying intensities and item popularity distributions, finding that the proactive version significantly reduces SLO violations.

Section: A Workload Forecastingmentioning

confidence: 85%

PAX: Partition-aware autoscaling for the Cassandra NoSQL database

NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium

Buyya

Casale

2018

Self Cite

“…For example, a job that visits multiple times the same node, requesting different execution requirements at each visit, may be modelled by assuming that the job switches class in-between visits. Distributed NoSQL databases such as Apache Cassandra provide an example, in which class-switching can be used to express a workflow of execution through multiple nodes in order to retrieve the data needed by a query for its completion [DCS17].…”

Section: Discussionmentioning

confidence: 99%

Performance Modelling and Optimisation of NoSQL Database Systems

SIGMETRICS Perform. Eval. Rev.

2020

Self Cite

Salvatore Dipietro is a final-year PhD candidate in Computing at Imperial College London. His current research focus is on performance modelling and optimization of NoSQL database systems. His work is supported by HiPEDS centre for doctoral training, funded by EPSRC. Before the PhD, Salvatore completed his MRes in Advance computing at Imperial College London (2015) and his MSc in computer security and forensics at Bedfordshire University (2013). Earlier he gained his undergraduate degree in computing engineering at Politecnico di Torino (2012). Besides, he also has extensive professional work experience as Cloud and DevOps engineer. His interest lies in performance and optimization, capacity planning, distributed applications and network security.

“…The Jensen Shannon divergence is bounded between 0 ≤ D JS ≤ 1 In this section, we illustrate the Cassandra model used for evaluating the SD algorithm. This is a simplified version of the model presented in [5]. With the aim to reduce the model complexity and the system state space, we have developed a model able to support only the Consistency Level ONE.…”

Section: B Divergence Measuresmentioning

confidence: 99%

“…Each Cassandra node uses the processor sharing (PS) scheduling policy. All the other stations used in [5], such as the networks and disk queues, have been grouped in a single infinite queue (or infinite server), called 'Net' positioned right after the workload generator. In addition, the workload generator has also been modelled as an infinite server.…”

Section: B Divergence Measuresmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

SD: A Divergence-Based Estimation Method for Service Demands in Cloud Systems

2019 7th International Conference on Future Internet of Things and Cloud (FiCloud)

Casale

2019

Self Cite

Estimating performance models parameters of cloud systems presents several challenges due to the distributed nature of the applications, the chains of interactions of requests with architectural nodes, and the parallelism and coordination mechanisms implemented within these systems. In this work, we present a new inference algorithm for model parameters, called state divergence (SD) algorithm, to accurately estimate resource demands in a complex cloud application. Differently from existing approaches, SD attempts to minimize the divergence between observed and modeled marginal state probabilities for individual nodes within an application, therefore requiring the availability of probabilistic measures from both the system and the underpinning model. Validation against a case study using the Apache Cassandra NoSQL database and random experiments show that SD can accurately predict demands and improve system behavior modeling and prediction.