“…Dogša and Batic [10] The effectiveness of test-driven development : an industrial case study Fucci and Turhan [16] A Replicated Experiment on the Effectiveness of Test-first Development Thomson et al [44] What Makes Testing Work: Nine Case Studies of Software Development Teams Romano et al [35] Findings from a multi-method study on test-driven development Buchan et al [7] Causal Factors, Benefits and Challenges of Test-Driven Development: Practitioner Perceptions Scanniello et al [38] Students' and Professionals' Perceptions of Test-driven Development: A Focus Group Study Beller et al [4] Developer Testing in The IDE: Patterns, Beliefs, And Behavior Bannerman and Martin [1] A multiple comparative study of test-with development product changes and their effects on team speed and product quality Table 3: The primary studies collected in the third step Authors Title Karac et al [23] A Controlled Experiment with Novice Developers on the Impact of Task Description Granularity on Software Quality in Test-Driven Development Tosun et al [46] Investigating the Impact of Development Task on External Quality in Test-Driven Development: An Industry Experiment Borle et al [6] Analyzing the effects of test driven development in GitHub Fucci et al [15] A longitudinal cohort study on the retainment of test-driven development Kazerouni et al [24] Assessing Incremental Testing Practices and Their Impact on Project Outcomes Santos et al [37] Improving Development Practices through Experimentation : an Industrial TDD Case Tosun et al [45] On the Effectiveness of Unit Tests in Test-driven Development Santos et al [36] Does the Performance of TDD Hold Across Software Companies and Premises? A Group of Industrial Experiments on TDD Romano et al [34] An Empirical Assessment on Affective Reactions of Novice Developers When Applying Test-Driven Development Sundelin et al [43] Test-Driving FinTech Product Development: An Experience Report In terms of test quality, research has explored the quality of tests by measuring mutation scores (i.e., the bug detection ability of the tests) and code coverage (i.e., the degree to which the source code of a program is executed when a test suite runs). For example, Tosun et al [45] conducted an experiment with 24 professionals and found that unit-test cases developed in TDD have a higher mutation score and branch coverage, but less method coverage than those developed in ITL.…”