On the unreliability of bug severity data

作者: Yuan Tian , Nasir Ali , David Lo , Ahmed E. Hassan

DOI: 10.1007/S10664-015-9409-1

关键词: Machine learningSoftwareData qualityArtificial intelligenceData miningSoftware systemSoftware ProblemReliability (statistics)Computer science

摘要: Severity levels, e.g., critical and minor, of bugs are often used to prioritize development efforts. Prior research efforts have proposed approaches automatically assign the severity label a bug report. All prior verify accuracy their using human-assigned reports data that is stored in software repositories. However, all assume such reliable. Hence perfect automated approach should be able same as repository --- achieving 100% accuracy. Looking at duplicate (i.e., referring problem) from three open-source systems (OpenOffice, Mozilla, Eclipse), we find around 51 % inconsistent labels even though they refer problem. While our results do indicate unreliable labels, believe send warning signals about reliability full including non-duplicate reports). Future explore if findings generalize dataset. Moreover, factor nature data. Given data, classical metrics assess models/learners not for assessing assigning label. Hence, propose new performance models. Our assessment shows current perform well 77-86 agreement with labels.

参考文章(34)
Ahmed Lamkanfi, Serge Demeyer, Emanuel Giger, Bart Goethals, Predicting the severity of a reported bug 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010). pp. 1- 10 ,(2010) , 10.1109/MSR.2010.5463284
Chieh-Yi Lin, Nan-Hsing Chiu, Sun-Jen Huang, Fuzzy Decision Tree Approach for Embedding Risk Assessment Information into Software Cost Estimation Model Journal of Information Science and Engineering. ,vol. 22, pp. 297- 313 ,(2006) , 10.6688/JISE.2006.22.2.5
Yuan Tian, Chengnian Sun, David Lo, Improved Duplicate Bug Report Identification conference on software maintenance and reengineering. pp. 385- 390 ,(2012) , 10.1109/CSMR.2012.48
Feng Zhang, Foutse Khomh, Ying Zou, Ahmed E. Hassan, An Empirical Study on Factors Impacting Bug Fixing Time working conference on reverse engineering. pp. 225- 234 ,(2012) , 10.1109/WCRE.2012.32
K. Strike, K. El Emam, N. Madhavji, Software cost estimation with incomplete data IEEE Transactions on Software Engineering. ,vol. 27, pp. 890- 908 ,(2001) , 10.1109/32.962560
Chengnian Sun, David Lo, Xiaoyin Wang, Jing Jiang, Siau-Cheng Khoo, A discriminative model approach for accurate duplicate bug report retrieval Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - ICSE '10. ,vol. 1, pp. 45- 54 ,(2010) , 10.1145/1806799.1806811
Harold Valdivia Garcia, Emad Shihab, None, Characterizing and predicting blocking bugs in open source projects mining software repositories. pp. 72- 81 ,(2014) , 10.1145/2597073.2597099
Giuliano Antoniol, Kamel Ayari, Massimiliano Di Penta, Foutse Khomh, Yann-Gaël Guéhéneuc, Is it a bug or an enhancement? Proceedings of the 2008 conference of the center for advanced studies on collaborative research meeting of minds - CASCON '08. pp. 23- ,(2008) , 10.1145/1463788.1463819
Ahmed Lamkanfi, Serge Demeyer, Quinten David Soetens, Tim Verdonck, Comparing Mining Algorithms for Predicting the Severity of a Reported Bug conference on software maintenance and reengineering. pp. 249- 258 ,(2011) , 10.1109/CSMR.2011.31
Thorsten Joachims, Text Categorization with Suport Vector Machines: Learning with Many Relevant Features european conference on machine learning. ,vol. 1398, pp. 137- 142 ,(1998) , 10.1007/BFB0026683