KATCH: high-coverage testing of software patches

Proceedings of the 2014 International Symposium on Software Testing and Analysis

2014

Self Cite

Software repositories provide rich information about the construction and evolution of software systems. While static data that can be mined directly from version control systems has been extensively studied, dynamic metrics concerning the execution of the software have received much less attention, due to the inherent difficulty of running and monitoring a large number of software versions.In this paper, we present Covrig, a flexible infrastructure that can be used to run each version of a system in isolation and collect static and dynamic software metrics, using a lightweight virtual machine environment that can be deployed on a cluster of local or cloud machines.We use Covrig to conduct an empirical study examining how code and tests co-evolve in six popular open-source systems. We report the main characteristics of software patches, analyse the evolution of program and patch coverage, assess the impact of nondeterminism on the execution of test suites, and investigate whether the coverage of code containing bugs and bug fixes is higher than average.

Section: Related Workmentioning

confidence: 99%

Covrig: a framework for the analysis of code, test, and coverage evolution in real software

Marinescu

Hošek

Proceedings of the 2014 International Symposium on Software Testing and Analysis

2014

Self Cite

“…executes at least one patch statement-if such an input does not exist in the program's test suite, it could be generated using previous techniques such as KATCH [20].…”

Section: Overviewmentioning

confidence: 99%

“…At the very minimum, each line of code affected by the patch should be covered by at least one test case. While this level of testing is still far from being achieved in practice [21], automatic techniques for enabling high-coverage patch testing are becoming more and more successful [1,17,20,29,31]. Many of these techniques are based on dynamic symbolic execution [7], a program analysis technique that provides the ability to generate inputs that form high-coverage test suites.…”

Section: Introductionmentioning

confidence: 99%

Shadow of a doubt

Palikareva

Kuchta

Proceedings of the 38th International Conference on Software Engineering

2016

Self Cite

While developers are aware of the importance of comprehensively testing patches, the large effort involved in coming up with relevant test cases means that such testing rarely happens in practice. Furthermore, even when test cases are written to cover the patch, they often exercise the same behaviour in the old and the new version of the code.In this paper, we present a symbolic execution-based technique that is designed to generate test inputs that cover the new program behaviours introduced by a patch. The technique works by executing both the old and the new version in the same symbolic execution instance, with the old version shadowing the new one. During this combined shadow execution, whenever a branch point is reached where the old and the new version diverge, we generate a test case exercising the divergence and comprehensively test the new behaviours of the new version.We evaluate our technique on the Coreutils patches from the CoREBench suite of regression bugs, and show that it is able to generate test inputs that exercise newly added behaviours and expose some of the regression bugs.

“…Unfortunately, often many of these changes themselves give rise to a significant amount of program bugs: new code is by definition little tested in the field, and often introduces bugs that affect both old and new functionality [10,18]. Even relatively simple, self-contained patches which are meant to fix specific bugs often introduce new errors [10,28].…”

Section: Introductionmentioning

confidence: 99%

“…For example, KATCH [18] uses a simple textual differencing algorithm to find out what statements have been added or changed, while DiSE [22] employs static program slicing to determine the statements affected by the patch. While this can lead to significant savings, static analysis of the program differences is often imprecise, and can miss important pruning and prioritisation opportunities, particularly those which exploit dynamic value information.…”

Section: Introductionmentioning

confidence: 99%

Shadow symbolic execution for better testing of evolving software

Companion Proceedings of the 36th International Conference on Software Engineering

Palikareva

2014

Self Cite

In this idea paper, we propose a novel way for improving the testing of program changes via symbolic execution. At a high-level, our technique runs two different program versions in the same symbolic execution instance, with the old version effectively shadowing the new one. In this way, the technique can exploit precise dynamic value information to effectively drive execution toward the behaviour that has changed from one version to the next. We discuss the main challenges and opportunities of this approach in terms of pruning and prioritising path exploration, mapping elements across versions, and sharing common symbolic state between versions.