Compositional Vector Space Models for Improved Bug Localization

作者: Shaowei Wang , David Lo , Julia Lawall

DOI: 10.1109/ICSME.2014.39

关键词:

摘要: Software developers and maintainers often need to locate code units responsible for a particular bug. A number of Information Retrieval (IR) techniques have been proposed map natural language bug descriptions the associated units. The vector space model (VSM) with standard tf-idf weighting scheme (VSM natural), has shown outperform nine other state-of-the-art IR techniques. However, there are multiple VSM variants different schemes, their relative performance differs software systems. Based on this observation, we propose compose various variants, modelling composition as an optimization problem. We genetic algorithm (GA) based approach explore possible compositions output heuristically near-optimal composite model. evaluated our against several baselines thousands reports from AspectJ, Eclipse, SWT. On average, composite) improves hit at 5 (Hit@5), mean average precision (MAP), reciprocal rank (MRR) scores by 18.4%, 20.6%, 10.5% respectively. also integrate compositional AmaLgam, which is state-of-art localization technique. resultant named AmaLgam can improve Hit@5, MAP, MRR 8.0%, 14.4% 6.5%

参考文章(49)
M. Di Penta, M. Harman, G. Antoniol, A robust search-based approach to project management in the presence of abandonment, rework, error and uncertainty ieee international software metrics symposium. pp. 172- 183 ,(2004) , 10.1109/METRICS.2004.4
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Bogdan Dit, Meghan Revelle, Malcom Gethers, Denys Poshyvanyk, Feature location in source code: a taxonomy and survey Journal of Software: Evolution and Process. ,vol. 25, pp. 53- 95 ,(2013) , 10.1002/SMR.567
Frank Wilcoxon, Individual Comparisons by Ranking Methods Springer Series in Statistics. ,vol. 1, pp. 196- 202 ,(1992) , 10.1007/978-1-4612-4380-9_16
Shaowei Wang, David Lo, Zhenchang Xing, Lingxiao Jiang, Concern Localization using Information Retrieval: An Empirical Study on Linux Kernel working conference on reverse engineering. pp. 92- 96 ,(2011) , 10.1109/WCRE.2011.72
Lucia, David Lo, Xin Xia, Fusion fault localizers automated software engineering. pp. 127- 138 ,(2014) , 10.1145/2642937.2642983
Liang Gong, David Lo, Lingxiao Jiang, Hongyu Zhang, Interactive fault localization leveraging simple user feedback international conference on software maintenance. pp. 67- 76 ,(2012) , 10.1109/ICSM.2012.6405255