Query-sensitive similarity measures for information retrieval

作者: Anastasios Tombros , C.J. van Rijsbergen

DOI: 10.1007/S10115-003-0115-8

关键词:

摘要: The application of document clustering to information retrieval has been motivated by the potential effectiveness gains postulated cluster hypothesis. hypothesis states that relevant documents tend be highly similar each other and therefore appear in same clusters. In this paper we propose an axiomatic view suggesting query (co-relevant documents) display inherent similarity is dictated itself. Because similarity, should valid for any collection. Our research describes attempt devise means which can detected. We use query-sensitive measures bias interdocument relationships toward pairs jointly possess attributes expressed a query. experimentally tested three against conventional ones do not take into account, also examined comparative measures. calculated varying numbers top-ranked six collections. results show consistent significant increase number become nearest neighbors given when are used. These suggest cluster-based system through

参考文章(32)
P. Willett, Query-specific automatic document classification International Forum on Information and Documentation. ,vol. 10, pp. 28- 32 ,(1985)
D K Harman, The first text REtrieval conference (TREC-1) Special Publication (NIST SP) - 500-207. ,(1993) , 10.6028/NIST.SP.500-207
Nelson Goodman, Problems and projects ,(1979)
Jonathan Furner-Hines, David Ellis, Peter Willett, Measuring the degree of similarity between objects in text retrieval systems Perspectives in Information Management. ,vol. 3, ,(1993)
Ellen M. Voorhees, The Effectiveness and Efficiency of Agglomerative Hierarchic Clustering in Document Retrieval The Effectiveness and Efficiency of Agglomerative Hierarchic Clustering in Document Retrieval. ,(1985)
Ji-Rong Wen, Jian-Yun Nie, Hong-Jiang Zhang, Clustering user queries of a search engine Proceedings of the tenth international conference on World Wide Web - WWW '01. pp. 162- 168 ,(2001) , 10.1145/371920.371974
Gerard Salton, Christopher Buckley, Term Weighting Approaches in Automatic Text Retrieval Information Processing and Management. ,vol. 24, pp. 323- 328 ,(1988) , 10.1016/0306-4573(88)90021-0
Jinxi Xu, W. Bruce Croft, Quary Expansion Using Local and Global Document Analysis international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 4- 11 ,(1996) , 10.1145/3130348.3130364