Automatic Fine-Grained Issue Report Reclassification

作者: Pavneet Singh Kochhar , Ferdian Thung , David Lo

DOI: 10.1109/ICECCS.2014.25

关键词:

摘要: Issue tracking systems are valuable resources during software maintenance activities. These contain different categories of issue reports such as bug, request for improvement (RFE), documentation, refactoring, task etc. While logging into a system, reporters can indicate the category reports. Herzig et al. Recently reported that more than 40% given wrong in systems. Among marked bugs, 30% them not bug The misclassification adversely affects developers they then need to manually identify various To address this problem, paper we propose an automated technique reclassifies report appropriate category. Our approach extracts feature values from and predicts if needs be reclassified its We have evaluated our reclassify 7,000 HTTP Client, Jackrabbit, Lucene-Java, Rhino, Tomcat 5 1 out 13 categories. experiments show achieve weighted precision, recall, F1 (F-measure) score ranges 0.58-0.71, 0.61-0.72, 0.57-0.71 respectively. In terms F1, which is harmonic mean precision substantially outperform several baselines by 28.88%-416.66%.

参考文章(38)
David Lowe, David S. Broomhead, Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks Complex Systems. ,vol. 2, pp. 321- 355 ,(1988)
Davor Cubranic, Gail C. Murphy, Automatic bug triage using text categorization. software engineering and knowledge engineering. pp. 92- 97 ,(2004)
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Kamal Nigam, Andrew McCallum, A comparison of event models for naive bayes text classification national conference on artificial intelligence. pp. 41- 48 ,(1998)
Peter Willett, Karen Sparck Jones, Readings in information retrieval Morgan Kaufmann Publishers Inc.. ,(1997)
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
LiGuo Huang, Vincent Ng, Isaac Persing, Ruili Geng, Xu Bai, Jeff Tian, AutoODC: Automated generation of Orthogonal Defect Classifications automated software engineering. ,vol. 22, pp. 412- 415 ,(2011) , 10.1109/ASE.2011.6100086
Anita Prinzie, Dirk Van den Poel, Random Forests for multiclass classification: Random MultiNomial Logit Expert Systems With Applications. ,vol. 34, pp. 1721- 1732 ,(2008) , 10.1016/J.ESWA.2007.01.029
John Anvik, Lyndon Hiew, Gail C. Murphy, Coping with an open bug repository eclipse technology exchange. pp. 35- 39 ,(2005) , 10.1145/1117696.1117704
Yuan Tian, David Lo, Chengnian Sun, Information Retrieval Based Nearest Neighbor Classification for Fine-Grained Bug Severity Prediction working conference on reverse engineering. pp. 215- 224 ,(2012) , 10.1109/WCRE.2012.31