Failure is a four-letter word

作者: Andreas Zeller , Thomas Zimmermann , Christian Bird

DOI: 10.1145/2020390.2020395

关键词:

摘要: Background: The past years have seen a surge of techniques predicting failure-prone locations based on more or less complex metrics. Few these metrics are actionable, though.Aims: This paper explores simple, easy-to-implement method to predict and avoid failures in software systems. IROP links elementary source code features known lightweight, fashion.Method: We sampled the Eclipse data set mapping defects files three releases. used logistic regression associate programmer actions with defects, tested predictive power resulting classifier terms precision recall, isolated most defect-prone actions. also collected initial feedback possible remedies.Results: In our sample set, correctly predicted up 74% modules, which is par elaborate predictors available. four easy-to-remember recommendations, telling programmers precisely what do errors. Initial from developers suggests that recommendations straightforward follow practice.Conclusions: With abundance development data, even simplest methods can produce "actionable" results.

参考文章(14)
William R. Shadish, Donald Thomas Campbell, Thomas D. Cook, Experimental and Quasi-Experimental Designs for Generalized Causal Inference ,(2001)
Darrell Huff, How to Lie with Statistics ,(1954)
Minghui Zhou, Audris Mockus, Developer fluency: achieving true mastery in software projects foundations of software engineering. pp. 137- 146 ,(2010) , 10.1145/1882291.1882313
Andrew J. Ko, Robert DeLine, Gina Venolia, Information Needs in Collocated Software Development Teams international conference on software engineering. pp. 344- 353 ,(2007) , 10.1109/ICSE.2007.45
Abram Hindle, Daniel M. German, Ric Holt, What do large commits tell us? Proceedings of the 2008 international workshop on Mining software repositories - MSR '08. pp. 99- 108 ,(2008) , 10.1145/1370750.1370773
Yoav Benjamini, Yosef Hochberg, Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 57, pp. 289- 300 ,(1995) , 10.1111/J.2517-6161.1995.TB02031.X
Jo E. Hannay, Dag I.K. Sjoberg, Tore Dyba, A Systematic Review of Theory Use in Software Engineering Experiments IEEE Transactions on Software Engineering. ,vol. 33, pp. 87- 107 ,(2007) , 10.1109/TSE.2007.12
Thomas Zimmermann, Rahul Premraj, Andreas Zeller, Predicting Defects for Eclipse model driven engineering languages and systems. pp. 76- ,(2007) , 10.1109/PROMISE.2007.10
Nicolas Bettenburg, Sascha Just, Adrian Schröter, Cathrin Weiß, Rahul Premraj, Thomas Zimmermann, Quality of bug reports in Eclipse Proceedings of the 2007 OOPSLA workshop on eclipse technology eXchange - eclipse '07. pp. 21- 25 ,(2007) , 10.1145/1328279.1328284
J. Greenwald, T. Menzies, A. Frank, Data Mining Static Code Attributes to Learn Defect Predictors IEEE Transactions on Software Engineering. ,vol. 33, pp. 2- 13 ,(2007) , 10.1109/TSE.2007.10