Predicting the severity of a reported bug

作者: Ahmed Lamkanfi , Serge Demeyer , Emanuel Giger , Bart Goethals

DOI: 10.1109/MSR.2010.5463284

关键词: StatisticsTraining setPrecision and recallGnomeSoftware debuggingData miningSoftware bugComputer scienceOpen source softwarePublic domain softwareEclipse

摘要: The severity of a reported bug is critical factor in deciding how soon it needs to be fixed. Unfortunately, while clear guidelines exist on assign the bug, remains an inherent manual process left person reporting bug. In this paper we investigate whether can accurately predict by analyzing its textual description using text mining algorithms. Based three cases drawn from open-source community (Mozilla, Eclipse and GNOME), conclude that given training set sufficient size (approximately 500 reports per severity), possible with reasonable accuracy (both precision recall vary between 0.65–0.75 Mozilla Eclipse; 0.70–0.85 case GNOME).

参考文章(17)
Davor Cubranic, Gail C. Murphy, Automatic bug triage using text categorization. software engineering and knowledge engineering. pp. 92- 97 ,(2004)
John Arthur Swets, David Marvin Green, Signal Detection Theory and Psychophysics ,(1974)
John Anvik, Lyndon Hiew, Gail C. Murphy, Who should fix this bug? Proceeding of the 28th international conference on Software engineering - ICSE '06. pp. 361- 370 ,(2006) , 10.1145/1134285.1134336
Gaeul Jeong, Sunghun Kim, Thomas Zimmermann, Improving bug triage with bug tossing graphs foundations of software engineering. pp. 111- 120 ,(2009) , 10.1145/1595696.1595715
M.F. Porter, An algorithm for suffix stripping Program: Electronic Library and Information Systems. ,vol. 40, pp. 313- 316 ,(1997) , 10.1108/EB046814
A.J. Ko, B.A. Myers, Duen Horng Chau, A Linguistic Analysis of How People Describe Software Problems symposium on visual languages and human-centric computing. pp. 127- 134 ,(2006) , 10.1109/VLHCC.2006.3