The Interplay of Sampling and Machine Learning for Software Performance Prediction

Kaltenecker, Christian; Grebhahn, Alexander; Siegmund, Norbert; Apel, Sven

doi:10.1109/ms.2020.2987024

Cited by 57 publications

(30 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most existing techniques build global performanceinfluence models and treat the system as a black box, measuring the system's execution in an environment with a given workload for a subset of all configurations, and learning a model from these observations. The sampling (i.e., selecting which configurations to measure) and learning techniques used [15,17,35,36,49,61,[63][64][65] result in tradeoffs among the cost to build the models and the accuracy and interpretability of the models [15,35,38]. For example, larger samples are more expensive, but usually lead to more accurate models; random forests, with large enough samples, tend to learn more accurate models than those built with linear regression, but the models are harder to interpret when users want to understand performance or debug their systems [15,35,49] (see Fig.…”

Section: Introductionmentioning

confidence: 99%

“…• A replication package with subject systems, experimental setup, and data of several months of measurements [74]. There is substantial literature on modeling the performance of software systems [e.g., 15,35,38,75]. Performanceinfluence models solve a specific problem: Explaining how options and their interactions influence a system's performance for a given workload and environment, designed to help users understand performance and make deliberate configuration decisions.…”

Section: Introductionmentioning

confidence: 99%

“…Common learning techniques include linear regression [36,[63][64][65], regression trees [15,17,18,61], Fourier Learning [19], and neural networks [20]. Different sampling and learning techniques yield different tradeoffs between measurement effort, prediction accuracy, and interpretability of the learned models [15,35,38].…”

Section: Introductionmentioning

confidence: 99%

“…In these scenarios, the model's prediction accuracy is important across the entire configuration space, but understanding the structure of the model is not important. In this context, deep regression trees [17,18,61], Fourier Learning [19], and neural networks [20] are commonly used, which build accurate models, but are not easy to interpret by humans [15,35,38,49,63].…”

Section: Introductionmentioning

confidence: 99%

“…When performance-influence models are used by users to make deliberate configuration decisions [15,35,38,63,76,80] (e.g., whether to accept the performance overhead of encryption), interpretability regarding how options and interactions influence performance becomes paramount. In these settings, researchers usually suggest sparse linear models, such as 8 + 15A + 10C + 3AB + 30AC above, typically learned with stepwise linear regression or similar variations [36,[63][64][65].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

White-Box Analysis over Machine Learning: Modeling Performance of Configurable Systems

Velez

Jamshidi

Siegmund

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

Self Cite

View full text Add to dashboard Cite

Performance-influence models can help stakeholders understand how and where configuration options and their interactions influence the performance of a system. With this understanding, stakeholders can debug performance behavior and make deliberate configuration decisions. Current black-box techniques to build such models combine various sampling and learning strategies, resulting in tradeoffs between measurement effort, accuracy, and interpretability. We present Comprex, a white-box approach to build performance-influence models for configurable systems, combining insights of local measurements, dynamic taint analysis to track options in the implementation, compositionality, and compression of the configuration space, without relying on machine learning to extrapolate incomplete samples. Our evaluation on 4 widely-used, open-source projects demonstrates that Comprex builds similarly accurate performance-influence models to the most accurate and expensive black-box approach, but at a reduced cost and with additional benefits from interpretable and local models.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

White-Box Analysis over Machine Learning: Modeling Performance of Configurable Systems

Velez

Jamshidi

Siegmund

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

Self Cite

View full text Add to dashboard Cite

show abstract

Multi-objective Parameter Tuning with Dynamic Compositional Surrogate Models

Pukhkaiev

Husak

Götz

et al. 2021

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Configuration Optimization with Limited Functional Impact

Guégain

Taherkordi

Quinton

2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Dealing with a large configuration space is a complex task for developers, especially when configurations must comply with both functional constraints and non-functional goals. In this paper, we introduce an approach to optimize any set of performance indicators for an existing configuration, while meeting functional requirements. The efficiency of this approach is assessed by exhaustively optimizing a configurable system, and by analyzing how the algorithm navigates through the configuration space. This approach proves especially efficient at optimizing configurations through a minimal number of changes, thus limiting the impact on their functional behavior.

show abstract

The Interplay of Sampling and Machine Learning for Software Performance Prediction

Cited by 57 publications

References 16 publications

White-Box Analysis over Machine Learning: Modeling Performance of Configurable Systems

White-Box Analysis over Machine Learning: Modeling Performance of Configurable Systems

Multi-objective Parameter Tuning with Dynamic Compositional Surrogate Models

Configuration Optimization with Limited Functional Impact

Contact Info

Product

Resources

About