2020
DOI: 10.1007/978-3-030-49435-3_30
|View full text |Cite
|
Sign up to set email alerts
|

Mutation Operators for Large Scale Data Processing Programs in Spark

Abstract: This paper proposes a mutation testing approach for big data processing programs that follow a data flow model, such as those implemented on top of Apache Spark. Mutation testing is a fault-based technique that relies on fault simulation by modifying programs, to create faulty versions called mutants. Mutant creation is carried on by operators able to simulate specific and well identified faults. A testing process must be able to signal faults within mutants and thereby avoid having ill behaviours within a pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
29
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2
1

Relationship

3
0

Authors

Journals

citations
Cited by 3 publications
(32 citation statements)
references
References 15 publications
3
29
0
Order By: Relevance
“…A natural extension to this work would be to instantiate the tool for other systems of the data flow family (DryadLINQ, Apache Beam, Apache Flink ). This can be done by adapting TRANSMUT-Spark's front and back ends so that a program originally written in any of them can be tested with the mutation testing proposed in [21].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…A natural extension to this work would be to instantiate the tool for other systems of the data flow family (DryadLINQ, Apache Beam, Apache Flink ). This can be done by adapting TRANSMUT-Spark's front and back ends so that a program originally written in any of them can be tested with the mutation testing proposed in [21].…”
Section: Discussionmentioning
confidence: 99%
“…We first applied the model to formalize the mutation operators presented in [21], where we explored the application of mutation testing in Spark programs, and in the tool TRANSMUT-Spark 4 that we developed to automate this process.…”
Section: Applications Of the Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…The abstract and formal concepts provided by the model make it suitable for the automation of software development processes, such as those done by IDE tools. Consequently, we first applied the model to formalize the mutation operators presented in [13], where we explored the application of mutation testing in Spark programs, and in the tool TRANSMUT-Spark 1 [25] that we devel- oped to automate this process. Mutation testing is a fault-based technique that simulates faults to design and evaluate test sets [26].…”
Section: Applications Of the Modelmentioning
confidence: 99%
“…We have used the model to define mutation operators that can be instantiated for different systems. In particular, specifications in our model have been used as an intermediate representation of programs in a mutation testing tool of Apache Spark programs [13].…”
Section: Introductionmentioning
confidence: 99%