Bug Severity Prediction Using a Hierarchical One-vs.-Remainder Approach

作者: Nonso Nnamoko , Luis Adrián Cabrera-Diego , Daniel Campbell , Yannis Korkontzelos

DOI: 10.1007/978-3-030-23281-8_20

关键词:

摘要: Assigning severity level to reported bugs is a critical part of software maintenance ensure an efficient resolution process. In many bug trackers, e.g. Bugzilla, this time consuming process, because reporters must manually assign one seven levels each bug. addition, some types may be more often than others, leading disproportionate distribution labels. Machine learning techniques can used predict the label newly automatically. However, from imbalanced data in multi-class task remains major difficulties for machine classifiers. paper, we propose hierarchical classification approach that exploits class imbalance training data, reduce bias. Specifically, designed tree consists multiple binary classifiers organised hierarchically, such instances most dominant are trained against remaining classes but not next tree. We FastText classifier test and compare between standard approaches. Based on 93,051 reports 38 Eclipse open-source products, was shown perform relatively well with \(65\%\) Micro F-Score \(45\%\) Macro F-Score.

参考文章(17)
Nivir Kanti-Singha Roy, Bruno Rossi, Towards an Improvement of Bug Severity Classification software engineering and advanced applications. pp. 269- 276 ,(2014) , 10.1109/SEAA.2014.51
Daniel Ramage, David Hall, Ramesh Nallapati, Christopher D. Manning, Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora empirical methods in natural language processing. pp. 248- 256 ,(2009) , 10.3115/1699510.1699543
K. K. Chaturvedi, V. B. Singh, Determining Bug severity using machine learning techniques 2012 CSI Sixth International Conference on Software Engineering (CONSEG). pp. 1- 6 ,(2012) , 10.1109/CONSEG.2012.6349519
Yuan Tian, David Lo, Chengnian Sun, Information Retrieval Based Nearest Neighbor Classification for Fine-Grained Bug Severity Prediction working conference on reverse engineering. pp. 215- 224 ,(2012) , 10.1109/WCRE.2012.31
Cheng-Zen Yang, Chun-Chi Hou, Wei-Chen Kao, Ing-Xiang Chen, An Empirical Study on Improving Severity Prediction of Defect Reports Using Feature Selection asia-pacific software engineering conference. ,vol. 1, pp. 240- 249 ,(2012) , 10.1109/APSEC.2012.144
Michael Gegick, Pete Rotella, Tao Xie, Identifying security bug reports via text mining: An industrial case study 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010). pp. 11- 20 ,(2010) , 10.1109/MSR.2010.5463340
Ahmed Lamkanfi, Serge Demeyer, Emanuel Giger, Bart Goethals, Predicting the severity of a reported bug 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010). pp. 1- 10 ,(2010) , 10.1109/MSR.2010.5463284
W. Philip Kegelmeyer, Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, SMOTE: synthetic minority over-sampling technique Journal of Artificial Intelligence Research. ,vol. 16, pp. 321- 357 ,(2002) , 10.1613/JAIR.953
Ahmed Lamkanfi, Serge Demeyer, Quinten David Soetens, Tim Verdonck, Comparing Mining Algorithms for Predicting the Severity of a Reported Bug conference on software maintenance and reengineering. pp. 249- 258 ,(2011) , 10.1109/CSMR.2011.31
Tim Menzies, Andrian Marcus, Automated severity assessment of software defect reports international conference on software maintenance. pp. 346- 355 ,(2008) , 10.1109/ICSM.2008.4658083