An approach and benchmark to detect behavioral changes of commits in continuous integration

Danglot, Benjamin; Monperrus, Martin; Rudametkin, Walter; Baudry, Benoît

doi:10.1007/s10664-019-09794-7

Cited by 15 publications

(7 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Because of the high computational cost of test generation, many tools have opted for integration into the continuous integration process (Arcuri et al 2016;Danglot et al 2020). This, however, leads to a long time distance between triggering the test generation and receiving results (Beller et al 2017), as well as the developers having to inspect the tools outside of their familiar development environment.…”

Section: Understandability Of Test Casesmentioning

confidence: 99%

Developer-centric test amplification

Brandt

Zaidman

2022

Empir Software Eng

View full text Add to dashboard Cite

Automatically generating test cases for software has been an active research topic for many years. While current tools can generate powerful regression or crash-reproducing test cases, these are often kept separately from the maintained test suite. In this paper, we leverage the developer’s familiarity with test cases amplified from existing, manually written developer tests. Starting from issues reported by developers in previous studies, we investigate what aspects are important to design a developer-centric test amplification approach, that provides test cases that are taken over by developers into their test suite. We conduct 16 semi-structured interviews with software developers supported by our prototypical designs of a developer-centric test amplification approach and a corresponding test exploration tool. We extend the test amplification tool DSpot, generating test cases that are easier to understand. Our IntelliJ plugin TestCube"Image missing" empowers developers to explore amplified test cases from their familiar environment. From our interviews, we gather 52 observations that we summarize into 23 result categories and give two key recommendations on how future tool designers can make their tools better suited for developer-centric test amplification.

show abstract

Section: Understandability Of Test Casesmentioning

confidence: 99%

Developer-centric test amplification

Brandt

Zaidman

2022

Empir Software Eng

View full text Add to dashboard Cite

show abstract

“…18,19 AMP add is a special case of AMP change , as its tools and techniques only focus on the changed parts of the project under test compared to its previous version (e.g., DCI). 20,21 On the other hand, AMP exec will modify the test suites dependencies like the OS file system, libraries, databases, remote services, or access APIs to GPS or Bluetooth (e.g., CAMP). 22,23 Finally, AMP mod techniques will try to make the test suite more precise by increasing the input exploration (e.g., TAUTOKO) 24 or regenerating the oracles (e.g., Orstra).…”

Section: Test Amplificationmentioning

confidence: 99%

AmPyfier: Test amplification in Python

Schoofs

Abdi

Demeyer

2022

J Software Evolu Process

View full text Add to dashboard Cite

Test amplification aims to automatically improve a test suite. One technique generates new test methods through transformations of the original tests. These test amplification tools heavily rely on analysis techniques that benefit a lot from type declarations present in the source code of projects written in statically typed languages. In dynamically typed languages, such type declarations are not available, and therefore, research regarding test amplification for those languages is sparse. Recent work has brought test amplification to the dynamically typed language Pharo Smalltalk by introducing the concept of dynamic type profiling. The technique is dependent on Pharo‐specific frameworks and has not yet been generalized to other languages. Another significant downside in test amplification tools based on the mutation score of a test suite is their high time cost. In this paper, we present AmPyfier, a tool that brings test amplification and type profiling to the dynamically typed language Python. AmPyfier introduces multi‐metric selection in order to increase the time efficiency of test amplification. We evaluated AmPyfier on 11 open‐source projects and found that AmPyfier could strengthen 37 out of 54 test classes. Multi‐metric selection decreased the time cost ranging from 17% to 98% as opposed to selection based on the full mutation score.

show abstract

“…Moreover, many studies used GumTree to analyze code patterns of changes such as bug-fixing changes [5], [13], [18], [21], [23], [29], logging changes [22] and changes to online code examples [34]. Also, prior work trained models based on the edit actions of changes that are calculated using GumTree [7], [15], [24], [31], [32]. Such models are used to recommend changes such as patches [32] and logging changes [22].…”

Section: B Use Of Ast Mapping Algorithmsmentioning

confidence: 99%

A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms

Fan

Xia

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

View full text Add to dashboard Cite

syntax tree (AST) mapping algorithms are widely used to analyze changes in source code. Despite the foundational role of AST mapping algorithms, little effort has been made to evaluate the accuracy of AST mapping algorithms, i.e., the extent to which an algorithm captures the evolution of code. We observe that a program element often has only one best-mapped program element. Based on this observation, we propose a hierarchical approach to automatically compare the similarity of mapped statements and tokens by different algorithms. By performing the comparison, we determine if each of the compared algorithms generates inaccurate mappings for a statement or its tokens. We invite 12 external experts to determine if three commonly used AST mapping algorithms generate accurate mappings for a statement and its tokens for 200 statements. Based on the experts' feedback, we observe that our approach achieves a precision of 0.98-1.00 and a recall of 0.65-0.75. Furthermore, we conduct a large-scale study with a dataset of ten Java projects containing a total of 263,165 file revisions. Our approach determines that GumTree, MTDiff and IJM generate inaccurate mappings for 20%-29%, 25%-36% and 21%-30% of the file revisions, respectively. Our experimental results show that state-of-the-art AST mapping algorithms still need improvements.

show abstract

An approach and benchmark to detect behavioral changes of commits in continuous integration

Cited by 15 publications

References 32 publications

Developer-centric test amplification

Developer-centric test amplification

AmPyfier: Test amplification in Python

A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms

Contact Info

Product

Resources

About