2022 IEEE Conference on Software Testing, Verification and Validation (ICST) 2022
DOI: 10.1109/icst53961.2022.00021
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating Features for Machine Learning Detection of Order- and Non-Order-Dependent Flaky Tests

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 40 publications
0
4
0
Order By: Relevance
“…We retrieved tests from the empirical study of flaky tests across programming languages of Costa et al [51] and from a recent study about pinpointing causes of flakiness by Habchi et al [52]. We also retrieved the flaky tests from iFixFlakies [17] as Test order dependency is a flakiness category that received a large interest in the community [9], [18], [54], [55].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…We retrieved tests from the empirical study of flaky tests across programming languages of Costa et al [51] and from a recent study about pinpointing causes of flakiness by Habchi et al [52]. We also retrieved the flaky tests from iFixFlakies [17] as Test order dependency is a flakiness category that received a large interest in the community [9], [18], [54], [55].…”
Section: Discussionmentioning
confidence: 99%
“…Others investigated the use of test smells [13] and code metrics [33] for predicting flaky tests. Trying to outperform the performances of existing approaches, others relied on a mix of static and dynamic features, like FlakeFlagger [34] or Flake16 [35]. Fixing flakiness is also an aspect that has recently been investigated.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Many existing flakiness detection approaches rely on information extracted at runtime: FlakeFlagger [15] and Flake16 [16] measure properties such as API usage, file-system access, memory usage, and threading behavior to extract features for training binary classifiers to distinguish flaky from nonflaky tests. Others go a step further and mutate the execution environment to expose flakiness by setting seeds of random number generators [19], switching implementations of methods with non-deterministic specifications [17], or adding noise to the execution environment [20].…”
Section: A Using Instrumentation or Language-specific Artifacts To De...mentioning
confidence: 99%