An Active Learning Approach to Ecien tly Ranking Retrieval Engines

作者: Lisa A. Torrey , Javed A. Aslam

DOI:

关键词:

摘要: Evaluating retrieval systems, such as those submitted to the annual TREC competition, usually requires a large number of documents be read and judged for relevance query topics. Test collections are far too big exhaustively judged, so only subset is selected form judgment \pool." The selection method that uses produces pools still quite large. Research has indicated it possible rank systems correctly using substantially smaller pools. This paper introduces an active learning algorithm whose goal reach correct rankings smallest judgments. It adds one document pool at time, always trying select with highest information gain. Several variants this described, each improvements on before. Results from experiments included comparison traditional pooling method. best version reliably outperforms method, although its degree improvement varies.

参考文章(4)
Ellen M. Voorhees, Donna Harman, Overview of the Eighth Text REtrieval Conference (TREC-8). text retrieval conference. ,(1999)
Ian Soboroff, Charles Nicholas, Patrick Cahan, Ranking retrieval systems without relevance judgments international acm sigir conference on research and development in information retrieval. pp. 66- 73 ,(2001) , 10.1145/383952.383961
Gordon V. Cormack, Christopher R. Palmer, Charles L. A. Clarke, Efficient construction of large test collections international acm sigir conference on research and development in information retrieval. pp. 282- 289 ,(1998) , 10.1145/290941.291009
Justin Zobel, How reliable are the results of large-scale information retrieval experiments? international acm sigir conference on research and development in information retrieval. pp. 307- 314 ,(1998) , 10.1145/290941.291014