Liberal relevance criteria of TREC -

作者: Eero Sormunen

DOI: 10.1145/564376.564433

关键词: Scale (chemistry)Computer scienceExperimental researchProcess (engineering)Relevance (information retrieval)Test (assessment)Information retrieval

摘要: Most test collections (like TREC and CLEF) for experimental research in information retrieval apply binary relevance assessments. This paper introduces a four-point scale reports the findings of project which TREC-7 TREC-8 document pools on 38 topics were reassessed. The goal reassessment was to build subcollection experiments highly relevant documents learn about assessment process as well characteristics multigraded corpus.Relevance criteria defined so that distinction made between rich topical (relevant documents) poor (marginally documents). It turned out 50% assessed regarded marginal. corpus lessons learned from are discussed. need develop more elaborated schemes is emphasized.

参考文章(12)
David Hawking, Overview of the TREC-9 Web Track. text retrieval conference. ,(2000)
E. Voorhees, Overview of the Seventh Text REtrieval Conference text retrieval conference. pp. 1- 24 ,(1998)
Michael Gordon, Praveen Pathak, Finding information on the World Wide Web: the retrieval effectiveness of search engines Information Processing and Management. ,vol. 35, pp. 141- 180 ,(1999) , 10.1016/S0306-4573(98)00041-7
Kalervo Järvelin, Jaana Kekäläinen, IR evaluation methods for retrieving highly relevant documents international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 41- 48 ,(2000) , 10.1145/3130348.3130374
Ellen M. Voorhees, Evaluation by highly relevant documents international acm sigir conference on research and development in information retrieval. pp. 74- 82 ,(2001) , 10.1145/383952.383963
Justin Zobel, How reliable are the results of large-scale information retrieval experiments? international acm sigir conference on research and development in information retrieval. pp. 307- 314 ,(1998) , 10.1145/290941.291014
Robert Burgin, Variations in relevance judgments and the evaluation of retrieval performance Information Processing and Management. ,vol. 28, pp. 619- 627 ,(1992) , 10.1016/0306-4573(92)90031-T
David C. Blair, M. E. Maron, An evaluation of retrieval effectiveness for a full-text document-retrieval system Communications of The ACM. ,vol. 28, pp. 289- 299 ,(1985) , 10.1145/3166.3197
Ellen M. Voorhees, Variations in relevance judgments and the measurement of retrieval effectiveness Information Processing & Management. ,vol. 36, pp. 697- 716 ,(2000) , 10.1016/S0306-4573(00)00010-8