Are mutation scores correlated with real fault detection?: a large scale empirical study on the relationship between mutants and real faults

作者: Mike Papadakis , Donghwan Shin , Shin Yoo , Doo-Hwan Bae

DOI: 10.1145/3180155.3180183

关键词: Fault detection and isolationTest suiteStatisticsVariablesMathematicsContrast (statistics)Mutation (genetic algorithm)Mutation testing

摘要: Empirical validation of software testing studies is increasingly relying on mutants. This practice motivated by the strong correlation between mutant scores and real fault detection that reported in literature. In contrast, our study shows correlations are results confounding effects test suite size. particular, we investigate relation two independent variables, mutation score size, with one dependent variable (real) faults. We use data sets, CoreBench De-fects4J, large C Java programs faults provide evidence all weak when controlling for also found both variables significantly influence one, better fits, but overall relative low prediction power. By measuring capability top ranked, according to score, suites (opposed randomly selected same size), achieving higher improves detection. Taken together, suggest mutants good guidance improving suites, their weak.

参考文章(46)
Paul Ammann, Jeff Offutt, Introduction to Software Testing Cambridge University Press. ,(2008) , 10.1017/CBO9780511809163
Jeff Offutt, Paul Ammann, Introduction to Software Testing ,(2016)
David Schuler, Andreas Zeller, Covering and Uncovering Equivalent Mutants Software Testing, Verification and Reliability. ,vol. 23, pp. 353- 374 ,(2013) , 10.1002/STVR.1473
Cristian Cadar, Daniel Dunbar, Dawson Engler, KLEE: unassisted and automatic generation of high-coverage tests for complex systems programs operating systems design and implementation. pp. 209- 224 ,(2008) , 10.5555/1855741.1855756
Marcel Böhme, Abhik Roychoudhury, CoREBench: studying complexity of regression errors Proceedings of the 2014 International Symposium on Software Testing and Analysis - ISSTA 2014. pp. 105- 115 ,(2014) , 10.1145/2610384.2628058
A. Jefferson Offutt, Jie Pan, Kanupriya Tewary, Tong Zhang, An experimental evaluation of data flow and mutation testing Software - Practice and Experience. ,vol. 26, pp. 165- 176 ,(1996) , 10.1002/(SICI)1097-024X(199602)26:2<165::AID-SPE5>3.0.CO;2-K
Carlos Pacheco, Michael D. Ernst, Randoop Companion to the 22nd ACM SIGPLAN conference on Object oriented programming systems and applications companion - OOPSLA '07. pp. 815- 816 ,(2007) , 10.1145/1297846.1297902
Hyunsook Do, Sebastian Elbaum, Gregg Rothermel, Supporting Controlled Experimentation with Testing Techniques: An Infrastructure and its Potential Impact Empirical Software Engineering. ,vol. 10, pp. 405- 435 ,(2005) , 10.1007/S10664-005-3861-2
Gordon Fraser, Andrea Arcuri, EvoSuite: automatic test suite generation for object-oriented software foundations of software engineering. pp. 416- 419 ,(2011) , 10.1145/2025113.2025179
Akbar Siami Namin, James H. Andrews, The influence of size and coverage on test suite effectiveness Proceedings of the eighteenth international symposium on Software testing and analysis - ISSTA '09. pp. 57- 68 ,(2009) , 10.1145/1572272.1572280