Predicting Effectiveness of IR-Based Bug Localization Techniques

作者: Tien-Duy B. Le , Ferdian Thung , David Lo

DOI: 10.1109/ISSRE.2014.39

关键词:

摘要: Recently, many information retrieval (IR) based bug localization approaches have been proposed in the literature. These use techniques to process a textual report and collection of source code files find buggy files. They output ranked list sorted by their likelihood contain bug. Recent can achieve reasonable accuracy, however, even state-of-the-art tool outputs lists where appear very low lists. This potentially causes developers distrust tools. Parnin Orso recently conduct user study highlight that do not an automated debugging useful if they root cause early list. To address this problem, we build oracle automatically predict whether produced IR-based is likely be effective or not. We consider file appears top-N position If unlikely effective, need waste time checking recommended one one. In such cases, it better for traditional methods request further localize bugs. oracle, our approach extracts features divided into four categories: score features, topic model metadata features. separate prediction each category, combine them create composite which used as oracle. name APRILE, stands Automated Prediction Bug Localization's Effectiveness. evaluated APRILE effectiveness three IR tools on more than thousands reports from AspectJ, Eclipse, SWT. average precision, recall, F-measure at least 70.36%, 66.94%, 68.03%, respectively. Furthermore, outperforms baseline 84.48%, 17.74%, 31.56% SWT reports,

参考文章(38)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
James F. Bowring, James M. Rehg, Mary Jean Harrold, Active learning for automatic classification of software behavior international symposium on software testing and analysis. ,vol. 29, pp. 195- 205 ,(2004) , 10.1145/1007512.1007539
Ripon K Saha, Matthew Lease, Sarfraz Khurshid, Dewayne E Perry, None, Improving bug localization using structured information retrieval automated software engineering. pp. 345- 355 ,(2013) , 10.1109/ASE.2013.6693093
Chakkrit Tantithamthavorn, Akinori Ihara, Ken-Ichi Matsumoto, Using Co-change Histories to Improve Bug Localization Performance 2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing. pp. 543- 548 ,(2013) , 10.1109/SNPD.2013.92
Emad Shihab, Akinori Ihara, Yasutaka Kamei, Walid M. Ibrahim, Masao Ohira, Bram Adams, Ahmed E. Hassan, Ken-ichi Matsumoto, Predicting Re-opened Bugs: A Case Study on the Eclipse Project working conference on reverse engineering. pp. 249- 258 ,(2010) , 10.1109/WCRE.2010.36
W. Eric Wong, Vidroha Debroy, Ruizhi Gao, Yihao Li, The DStar Method for Effective Software Fault Localization IEEE Transactions on Reliability. ,vol. 63, pp. 290- 308 ,(2014) , 10.1109/TR.2013.2285319
Stacy K. Lukins, Nicholas A. Kraft, Letha H. Etzkorn, Bug localization using latent Dirichlet allocation Information & Software Technology. ,vol. 52, pp. 972- 990 ,(2010) , 10.1016/J.INFSOF.2010.04.002
Yuan Tian, David Lo, Chengnian Sun, Information Retrieval Based Nearest Neighbor Classification for Fine-Grained Bug Severity Prediction working conference on reverse engineering. pp. 215- 224 ,(2012) , 10.1109/WCRE.2012.31
Holger Cleve, Andreas Zeller, Locating causes of program failures international conference on software engineering. pp. 342- 351 ,(2005) , 10.1145/1062455.1062522
Emad Shihab, Akinori Ihara, Yasutaka Kamei, Walid M. Ibrahim, Masao Ohira, Bram Adams, Ahmed E. Hassan, Ken-ichi Matsumoto, Studying re-opened bugs in open source software Empirical Software Engineering. ,vol. 18, pp. 1005- 1042 ,(2013) , 10.1007/S10664-012-9228-6