Is Distributed Database Evaluation Cloud-Ready?

Seybold, Daniel; Domaschka, Jörg

doi:10.1007/978-3-319-67162-8_12

Cited by 23 publications

(27 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hence, the evaluation approach requires the specification of multi-domain evaluation scenarios as depicted in Figure 1. Each evaluation domain comprises its own set of domain specific constraints, which affect the results for the specified evaluation objectives [35]. Consequently, domain knowledge in each evaluation domain is required, which makes the DBMS evaluation a complex and error prone task.…”

Section: Dbms Evaluation Challengesmentioning

confidence: 99%

“…Yet, choosing the right DBMS set-up in the jungle of available solutions is a complex task that is not done with the selection of a well-suited DBMS 1 , but continues with the selection of a cloud provider, and ends with the choice of the right size and amount of virtual machines. The three choices influence each other [35], so that making independent decisions may lead to sub-optimal results. Additionally, runtime parameters, including the expected workload, consistency requirements, and availability considerations, are influencing the set-up and depend on each other: for instance, the type of workload can influence whether a user should pay for having a local SSD attached to their virtual machines or not [18].…”

Section: Introductionmentioning

confidence: 99%

“…While benchmarking is an established approach to select software systems as well as hardware platforms, existing DBMS benchmarking frameworks cannot cope with the volatility of cloud environments [35], particularly as volatile environments demand for reliable and reproducible benchmarking [29]. Based on these observations, we claim that even with the knowledge of the workload and non-functional constraints, a manual selection of DBMS, cloud provider(s), and virtual machine types cannot deliver satisfactory results and that suitable tool support is strongly needed.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Mowgli

Seybold

Keppler

Gründler

et al. 2019

Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering

Self Cite

View full text Add to dashboard Cite

Big Data and IoT applications require highly-scalable database management system (DBMS), preferably operated in the cloud to ensure scalability also on the resource level. As the number of existing distributed DBMS is extensive, the selection and operation of a distributed DBMS in the cloud is a challenging task. While DBMS benchmarking is a supportive approach, existing frameworks do not cope with the runtime constraints of distributed DBMS and the volatility of cloud environments. Hence, DBMS evaluation frameworks need to consider DBMS runtime and cloud resource constraints to enable portable and reproducible results. In this paper we present Mowgli, a novel evaluation framework that enables the evaluation of non-functional DBMS features in correlation with DBMS runtime and cloud resource constraints. Mowgli fully automates the execution of cloud and DBMS agnostic evaluation scenarios, including DBMS cluster adaptations. The evaluation of Mowgli is based on two IoT-driven scenarios, comprising the DBMSs Apache Cassandra and Couchbase, nine DBMS runtime configurations, two cloud providers with two different storage backends. Mowgli automates the execution of the resulting 102 evaluation scenarios, verifying its support for portable and reproducible DBMS evaluations. The results provide extensive insights into the DBMS scalability and the impact of different cloud resources. The significance of the results is validated by the correlation with existing DBMS evaluation results.Since the era of RDBMS, their selection is guided by domain-specific benchmarks that have evolved together with distributed DBMSs.

show abstract

Section: Dbms Evaluation Challengesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Mowgli

Seybold

Keppler

Gründler

et al. 2019

Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering

Self Cite

View full text Add to dashboard Cite

show abstract

“…Our results show significant differences with respect to elasticity and the need for orchestrated DDBMS evaluation in order to provide adaptive and reproducible evaluation scenarios. Consequently, we analyze existing EFs with the focus on their evaluation scenarios and their consideration of cloud resources [11]. As existing EFs do not yet support orchestrated evaluation scenarios, elasticity and availability evaluation lacks dedicated support.…”

Section: Approachmentioning

confidence: 99%

“…Evaluating these requirements of existing DDBMS is a common approach to guide the DDBMS selection process. Yet, current evaluation frameworks (EFs) do not explicitly consider the usage of heterogeneous cloud resources and lack the support for orchestrated evaluation scenarios [11] with respect to scalability, elasticity and availability.…”

Section: Introductionmentioning

confidence: 99%

Towards a framework for orchestrated distributed database evaluation in the cloud

Seybold

2017

Proceedings of the 18th Doctoral Symposium of the 18th International Middleware Conference

Self Cite

View full text Add to dashboard Cite

The selection and operation of a distributed database management system (DDBMS) in the cloud is a challenging task as supportive evaluation frameworks miss orchestrated evaluation scenarios, hindering comparable and reproducible evaluations for heterogeneous cloud resources. We propose a novel evaluation approach that supports orchestrated evaluation scenarios for scalability, elasticity and availability by exploiting cloud resources. We highlight the challenges in evaluating DDBMSs in the cloud and introduce a cloud-centric framework for orchestrated DDBMS evaluation, enabling reproducible evaluations and significant rating indices.

show abstract

The Impact of the Storage Tier: A Baseline Performance Analysis of Containerized DBMS

Seybold¹,

Hauser²,

Eisenhart³

et al. 2018

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Containers emerged as cloud resource offerings. While the advantages of containers, such as easing the application deployment, orchestration and adaptation, work well for stateless applications, the feasibility of containerization of stateful applications, such as database management system (DBMS), still remains unclear due to potential performance overhead. The myriad of container operation models and storage backends even raises the complexity of operating a containerized DBMS. Here, we present an extensible evaluation methodology to identify performance overhead of a containerized DBMS by combining three operational models and two storage backends. For each combination a memory-bound and disk-bound workload is applied. The results show a clear performance overhead for containerized DBMS on top of virtual machines (VMs) compared to physical resources. Further, a containerized DBMS on top of VMs with different storage backends results in a tolerable performance overhead. Building upon these baseline results, we derive a set of open evaluation challenges for containerized DBMSs.

show abstract

Is Distributed Database Evaluation Cloud-Ready?

Cited by 23 publications

References 16 publications

Mowgli

Mowgli

Towards a framework for orchestrated distributed database evaluation in the cloud

The Impact of the Storage Tier: A Baseline Performance Analysis of Containerized DBMS

Contact Info

Product

Resources

About