VEER: Enhancing the Interpretability of Model-based Optimizations

Peng, Kewen; Kaltenecker, Christian; Siegmund, Norbert; Apel, Sven

doi:10.48550/arxiv.2106.02716

Cited by 1 publication

(2 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our study covers seven widely used machine learning models for learning software performance, i.e., Decision Tree (DT) [46] (used by [4,8,25,41]), 𝑘-Nearest Neighbours (𝑘NN) [21] (used by [35]), Kernel Ridge Regression (KRR) [52] (used by [35]), Linear Regression (LR) [23] (used by [4,8,49]), Neural Network (NN) [53] (used by [20,26]), Random Forest (RF) [30] (used by [45,50]), and Support Vector Regression (SVR) [17] (used by [4,50]), together with five popular real-world software systems from prior work [15,16,41,44], covering a wide spectrum of characteristics and domains. Naturally, the first research question (RQ) we ask is: RQ1: Is it practical to examine all encoding methods for finding the best one under every system?…”

Section: Research Questionsmentioning

confidence: 99%

“…We shortlisted systems and their data from recent studies on software configuration tuning and modeling [41,44], from which we identified five systems and their environment according to the above criteria, as shown in Table 2. The five systems contain different percentages of categorical/binary and numeric configuration options while covering five distinct domains.…”

Section: System and Data Selectionmentioning

confidence: 99%

See 1 more Smart Citation

Does Configuration Encoding Matter in Learning Software Performance? An Empirical Study on Encoding Schemes

Gong,

Chen

2022

Preprint

View full text Add to dashboard Cite

Learning and predicting the performance of a configurable software system helps to provide better quality assurance. One important engineering decision therein is how to encode the configuration into the model built. Despite the presence of different encoding schemes, there is still little understanding of which is better and under what circumstances, as the community often relies on some general beliefs that inform the decision in an ad-hoc manner. To bridge this gap, in this paper, we empirically compared the widely used encoding schemes for software performance learning, namely label, scaled label, and one-hot encoding. The study covers five systems, seven models, and three encoding schemes, leading to 105 cases of investigation.Our key findings reveal that: (1) conducting trial-and-error to find the best encoding scheme in a case by case manner can be rather expensive, requiring up to 400+ hours on some models and systems;(2) the one-hot encoding often leads to the most accurate results while the scaled label encoding is generally weak on accuracy over different models; (3) conversely, the scaled label encoding tends to result in the fastest training time across the models/systems while the one-hot encoding is the slowest; (4) for all models studied, label and scaled label encoding often lead to relatively less biased outcomes between accuracy and training time, but the paired model varies according to the system.We discuss the actionable suggestions derived from our findings, hoping to provide a better understanding of this topic for the community. To promote open science, the data and code of this work can be publicly accessed at https://github.com/ideas-labo/MSR2022encoding-study. CCS Concepts• Software and its engineering → Software performance.

show abstract