2020
DOI: 10.1007/s10664-020-09866-z
|View full text |Cite
|
Sign up to set email alerts
|

Using black-box performance models to detect performance regressions under varying workloads: an empirical study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 17 publications
(11 citation statements)
references
References 43 publications
0
11
0
Order By: Relevance
“…However, for M o , we cannot directly calculate the prediction error, since applying a model to its training data leads to biased (overly optimized) results. To address this issue, we apply the throw-one approach that is used in prior research (Liao et al, 2020). For each time period in the original set of workloads, we remove its data from the training data to rebuild the model and apply the rebuilt model on the time period.…”
Section: Resultsmentioning
confidence: 99%
“…However, for M o , we cannot directly calculate the prediction error, since applying a model to its training data leads to biased (overly optimized) results. To address this issue, we apply the throw-one approach that is used in prior research (Liao et al, 2020). For each time period in the original set of workloads, we remove its data from the training data to rebuild the model and apply the rebuilt model on the time period.…”
Section: Resultsmentioning
confidence: 99%
“…Thus, we also use Cliff's delta to quantify the magnitude of the differences (a.k.a., effect sizes). Cliff's delta measures the effect size statistically and has been used in prior engineering studies (Kitchenham et al, 2002;Li, Chen, Shang, & Hassan, 2018;Liao et al, 2020).…”
Section: Statistical Analyses On Performance Evaluation Resultsmentioning
confidence: 99%
“…Logistic regression is a statistical model that uses a logit function to model a binary variable (the target variable) as a linear combination of the independent variables Hosmer Jr, Lemeshow, and Sturdivant (2013), which is widely used in software analytics (Shang et al, 2015;Tantithamthavorn et al, 2018). XGBoost is an efficient and accurate implementation of the gradient boosting algorithm, which is reported to perform better than other machine learning models in software engineering applications Liao et al (2020). The neural network model Glorot, Bordes, and Bengio (2011) used in our study consists of four layers and are trained with 100 batch size, and 10 epochs.…”
Section: Splited Configuration Option Namesmentioning
confidence: 99%
“…This complementary analysis is helpful to (i) demonstrate that DeLag is able to detect patterns correlated with latency deviations even on complex workloads, and (ii) to give a better idea to the reader about the capabilities of DeLag in supporting the analysis of specific latency behaviors. Similarly to recent studies [57], [58], we use load mixtures that involve multiple types of simulated users (i.e., load drivers), where each user type performs different classes of requests on the system. For example, in the Train Ticket case of study, some types of user may only visit the homepage and subsequently search trains for some random locations, while others may first login and then book random tickets.…”
Section: Methodsmentioning
confidence: 99%