On the Accuracy of Spectrum-based Fault Localization

Abreu, Rui; Zoeteweij, Peter; Gemund, Arjan J. C. van

doi:10.1109/taicpart.2007.4344104

Cited by 167 publications

(375 citation statements)

References 2 publications

Supporting

Mentioning

373

Contrasting

Unclassified

Order By: Relevance

“…In the previous literature, interface, type, and variable declarations are considered in the component ranking and the C d metric, although their likelihoods are in most cases 0 because of the limitations on the code instrumentation, which causes ∀ ia ij = 0. This is especially true in Spectrum‐based techniques 9–11. As they are located at the bottom of the ranking, the number of inspected components (the numerator in the diagnostic effort formula used in Section 7.1) does not change.…”

Section: Resultsmentioning

confidence: 99%

“…Although there is a large number of different diagnosis techniques (see Section 9), our work is based on Bayesian diagnosis, well known from Model‐Based Diagnosis, an area within AI. Compared to other, statistical approaches, such as Tarantula 11, Ochiai 9, and alternative techniques 19–25, Bayesian diagnosis is founded on probability theory, and is the only technique that can serve as the base for our test prioritization heuristic search function, described in Section 4.…”

Section: Fault Diagnosismentioning

confidence: 99%

“…Automated fault‐localization techniques also aim at minimizing diagnostic cost when failures occur during the testing phase. Statistical approaches include the Tarantula tool by Jones et al 11, Ochiai by Abreu et al 9, the Nearest Neighbor technique by Renieris et al 22, Sober by Liu et al 21, CBI by Liblit and his colleagues 20, and CrossTab by Wang et al 25. Approaches to statistical fault localization need not be limited to the statement level, work that considers execution paths and dependencies includes 19, 23, 24, 43.…”

Section: Related Workmentioning

confidence: 99%

“…The debugging phase can make use of automatic fault localization techniques, which help to significantly reduce the manual debugging effort needed, as shown in 9–12. Fault localization algorithms use the information provided by tests executed in the testing phase to deduce a list of program elements (e.g.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Prioritizing tests for software fault diagnosis

et al. 2011

Self Cite

View full text Add to dashboard Cite

SUMMARY During regression testing, test prioritization techniques select test cases that maximize the confidence on the correctness of the system when the resources for quality assurance (QA) are limited. In the event of a test failing, the fault at the root of the failure has to be localized, adding an extra debugging cost that has to be taken into account as well. However, test suites that are prioritized for failure detection can reduce the amount of useful information for fault localization. This deteriorates the quality of the diagnosis provided, making the subsequent debugging phase more expensive, and defeating the purpose of the test cost minimization. In this paper we introduce a new test case prioritization approach that maximizes the improvement of the diagnostic information per test. Our approach minimizes the loss of diagnostic quality in the prioritized test suite. When considering QA cost as a combination of testing cost and debugging cost, on our benchmark set, the results of our test case prioritization approach show reductions of up to 60% of the overall combined cost of testing and debugging, compared with the next best technique. Copyright © 2011 John Wiley & Sons, Ltd.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Fault Diagnosismentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Prioritizing tests for software fault diagnosis

et al. 2011

Self Cite

View full text Add to dashboard Cite

show abstract

“…Xie et al provided a theoretical evaluation of the original SBFL approaches investigated by Naish et al and approaches that have been derived by a genetic algorithm . However, most of these metrics have originally been evaluated and have subsequently been compared on relatively small‐scale artifacts from the publicly available Software‐artifact Infrastructure Repository (SIR) . Only recently, some SBFL techniques have been evaluated on programs from the D efects4 J benchmark …”

Section: Introductionmentioning

confidence: 99%

An evaluation of pure spectrum‐based fault localization techniques for large‐scale software systems

et al. 2019

View full text Add to dashboard Cite

Pure spectrum-based fault localization (SBFL) is a well-studied statistical debugging technique that only takes a set of test cases (some failing and some passing) and their code coverage as input and produces a ranked list of suspicious program elements to help the developer identify the location of a bug that causes a failed test case. Studies show that pure SBFL techniques produce good ranked lists for small programs. However, our previous study based on the iBugs benchmark that uses the AspectJ repository shows that, for realistic programs, the accuracy of the ranked list is not suitable for human developers. In this paper, we confirm this based on a combined empirical evaluation with the iBugs and the Defects4J benchmark. Our experiments show that, on average, at most ∼40%, ∼80%, and ∼90% of the bugs can be localized reliably within the first 10, 100, and 1000 ranked lines, respectively, in the Defects4J benchmark.To reliably localize 90% of the bugs with the best performing SBFL metric D * , ∼450 lines have to be inspected by the developer. For human developers, this remains unsuitable, although the results improve compared with the results for the AspectJ benchmark. Based on this study, we can clearly see the need to go beyond pure SBFL and take other information, such as information from the bug report or from version history of the code lines, into consideration.

show abstract

Isolating the Causes of Emergent Failures in Computer Software

Gore¹

2018

Emergent Behavior in Complex Systems Engineering

View full text Add to dashboard Cite

On the Accuracy of Spectrum-based Fault Localization

Cited by 167 publications

References 2 publications

Prioritizing tests for software fault diagnosis

Prioritizing tests for software fault diagnosis

An evaluation of pure spectrum‐based fault localization techniques for large‐scale software systems

Isolating the Causes of Emergent Failures in Computer Software

Contact Info

Product

Resources

About