TANDEM: A Taxonomy and a Dataset of Real-World Performance Bugs

Sánchez, Ana María Sánchez; Delgado‐Pérez, Pedro; Medina‐Bulo, Inmaculada; Segura, Sergio

doi:10.1109/access.2020.3000928

Cited by 12 publications

(8 citation statements)

References 80 publications

(71 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Chen et al [16] surveyed and semantically categorized more than 700 performance bugs from real-world developer commits from 13 popular C/C++ projects. Sánchez et al [17] investigated the performance bugs across multiple publications in the research community and Tizpaz-Niari et al [18] surveyed performance bugs in machine learning libraries.…”

Section: Related Workmentioning

confidence: 99%

SlowCoach: Mutating Code to Simulate Performance Bugs

Chen

Schwahn

Natella

et al. 2022

2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE)

View full text Add to dashboard Cite

Performance bugs are unnecessarily inefficient code chunks in software codebases that cause prolonged execution times and degraded computational resource utilization. For performance bug diagnostics, tools that aid in the identification of said bugs, such as benchmarks and profilers, are commonly employed. However, due to factors such as insufficient workloads or ineffective benchmarks, software defects related to code inefficiencies are inherently difficult to diagnose. Hence, the capabilities of performance bug diagnostic tools are limited and performance bug instances may be missed. Traditional mutation testing (MT) is a technique for quantifying a test suite's ability to find functional bugs by mutating the code of the test subject. Similarly, we adopt performance mutation testing (PMT) to evaluate performance bug diagnostic tools and identify where improvements need to be made to a performance testing methodology. We carefully investigate the different performance bug fault models and how synthesized performance bugs based on these models can evaluate benchmarks and workload selection to help improve performance diagnostics. In this paper, we present the design of our PMT framework, SLOWCOACH, and evaluate it with over 1600 mutants from 4 real-world software projects.

show abstract

Section: Related Workmentioning

confidence: 99%

SlowCoach: Mutating Code to Simulate Performance Bugs

Chen

Schwahn

Natella

et al. 2022

2022 IEEE 33rd International Symposium on Software Reliability Engineering (ISSRE)

View full text Add to dashboard Cite

show abstract

“…What we can take away from this work is that: 1) some behaviours are undesirable but need not be defects, 2) some behaviours are trade-offs between one community and another, 3) program properties may be undesirable for developers but invisible to users, and 4) performance issues (including energy, execution time, and memory usage [33]) can be undesirable.…”

Section: A Bug Taxonomiesmentioning

confidence: 99%

“…These two categories can intersect if software quality is part of a program's specification. While practitioners and the literature do not agree on where to draw the line between bug and nonbug issues [24], the nomenclature of performance bugs [33] as well as the popularity of bug patterns for bad coding practices (Section II-B) are points in favour of treating ticks as a category of bugs.…”

Section: B Bugs Ticks and Defectsmentioning

confidence: 99%

Software Ticks Need No Specifications

Reichenbach

2021

2021 IEEE/ACM 43rd International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER)

View full text Add to dashboard Cite

Software bugs cost time, money, and lives. They drive software research and development efforts, and are central to modern software engineering. Yet we lack a clear and general definition of what bugs are. Some bugs are defects, clearly defined as failures to meet some requirement or specification. However, there are many forms of undesirable program behaviour that are completely compatible with a typical program's specification.In this paper, we argue that the lack of a criterion for identifying non-defect bugs is hampering the development of tools that find and fix bugs. We propose such a criterion, based on the idea of wasted effort, discuss how bugs that meet our definition of software ticks can complement defects, and sketch how our definition can help future work on software tools.

show abstract

“…These non-functional faults-unlike regular software bugs-do not cause the system to crash or exhibit any obvious misbehavior [76,85,99]. Instead, misconfigured systems remain operational but degrade in performance [16,71,75,86]. Misconfigurations caused major issues in cloud infrastructure [19], internetscale systems [14], and on-device machine learning systems [3].…”

Section: Introductionmentioning

confidence: 99%

Unicorn: Reasoning about Configurable System Performance through the lens of Causality

Shahriar¹,

Krishna²,

Javidian³

et al. 2022

Preprint

View full text Add to dashboard Cite

Modern computer systems are highly configurable, with the total variability space sometimes larger than the number of atoms in the universe. Understanding and reasoning about the performance behavior of highly configurable systems, due to a vast variability space, is challenging. State-of-theart methods for performance modeling and analyses rely on predictive machine learning models, therefore, they become (i) unreliable in unseen environments (e.g., different hardware, workloads), and (ii) produce incorrect explanations. To this end, we propose a new method, called Unicorn, which (i) captures intricate interactions between configuration options across the software-hardware stack and (ii) describes how such interactions impact performance variations via causal inference. We evaluated Unicorn on six highly configurable systems, including three on-device machine learning systems, a video encoder, a database management system, and a data analytics pipeline. The experimental results indicate that Unicorn outperforms state-of-the-art performance optimization and debugging methods. Furthermore, unlike the existing methods, the learned causal performance models reliably predict performance for new environments.1 we use non-functional and performance faults interchangeably for severe performance degradation that are caused by certain type of misconfigurations, aka specious configuration [49]. 1

show abstract

TANDEM: A Taxonomy and a Dataset of Real-World Performance Bugs

Cited by 12 publications

References 80 publications

SlowCoach: Mutating Code to Simulate Performance Bugs

SlowCoach: Mutating Code to Simulate Performance Bugs

Software Ticks Need No Specifications

Unicorn: Reasoning about Configurable System Performance through the lens of Causality

Contact Info

Product

Resources

About