Selecting the n-top retrieval result lists for an effective data fusion

作者: Antonio Juárez-González , Manuel Montes-y-Gómez , Luis Villaseñor-Pineda , David Pinto-Avendaño , Manuel Pérez-Coutiño

DOI: 10.1007/978-3-642-12116-6_49

关键词:

摘要: Although the application of data fusion in information retrieval has yielded good results majority cases, it been noticed that its achievement is dependent on quality input result lists. In order to tackle this problem, paper we explore combination only n-top lists as an alternative all available data. particular, describe a heuristic measure based redundancy and ranking evaluate each list, and, consequently, select presumably n-best per query. Preliminary four IR test collections, containing total 266 queries, employing three different DF methods are encouraging. They indicate proposed approach could significantly outperform achieved by lists, showing improvements mean average precision 10.7%, 3.7% 18.8% when was used along with Maximum RSV, CombMNZ Fuzzy Borda methods.

参考文章(25)
Christopher C Vogt, Garrison W Cottrell, Richard K Belew, Brian T Bartell, None, Using relevance to train a linear mixture of experts text retrieval conference. pp. 503- 515 ,(1996)
John D. Lafferty, Guy Lebanon, Cranking: Combining Rankings Using Conditional Probability Models on Permutations international conference on machine learning. pp. 363- 370 ,(2002)
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
John Dunnion, Fergus Toolan, David Lillis, Rem Collier, ProbFuse: a probabilistic approach to data fusion international acm sigir conference on research and development in information retrieval. pp. 139- 146 ,(2006) , 10.1145/1148170.1148197
Christopher C Vogt, Garrison W Cottrell, None, Predicting the performance of linearly combined IR systems international acm sigir conference on research and development in information retrieval. pp. 190- 196 ,(1998) , 10.1145/290941.290991
Shengli Wu, Sally McClean, Performance prediction of data fusion for information retrieval Information Processing and Management. ,vol. 42, pp. 899- 915 ,(2006) , 10.1016/J.IPM.2005.08.004
Mark D. Smucker, James Allan, Ben Carterette, Agreement among statistical significance tests for information retrieval evaluation at varying sample sizes international acm sigir conference on research and development in information retrieval. pp. 630- 631 ,(2009) , 10.1145/1571941.1572050
Richard K. Belew, Brian T. Bartell, Garrison W. Cottrell, Automatic combination of multiple ranked retrieval systems international acm sigir conference on research and development in information retrieval. pp. 173- 181 ,(1994) , 10.5555/188490.188554
D. Frank Hsu, Isak Taksa, Comparing Rank and Score Combination Methods for Data Fusion in Information Retrieval Information Retrieval. ,vol. 8, pp. 449- 480 ,(2005) , 10.1007/S10791-005-6994-4