Relation based term weighting regularization

作者: Hao Wu , Hui Fang

DOI: 10.1007/978-3-642-28997-2_10

关键词:

摘要: Traditional retrieval models compute term weights based on only the information related to individual terms such as TF and IDF. However, query are related. Intuitively, these relations could provide useful about importance of a in context other terms. For example, "perl tutorial" specifies that user look for relevant both perl tutorial. Thus, document containing should have higher relevance score than ones with one them. if IDF value "tutorial" is much smaller "perl", existing may assign lower those multiple occurrences "perl". It clear be dependent not collection statistics but also In this work, we study how utilize semantic among regularize weighting. Experiment results over TREC collections show proposed strategy effective improve performance.

参考文章(21)
Claire Cardie, Mandar Mitra, Chris Buckley, Amit Singhal, An analysis of statistical and syntactic phrases RIAO '97 Computer-Assisted Information Searching on Internet. pp. 200- 214 ,(1997)
Ellen M. Voorhees, Overview of the TREC 2005 Robust Retrieval Track text retrieval conference. ,(2005)
Mike Gatford, Micheline Hancock-Beaulieu, Susan Jones, Stephen E. Robertson, Steve Walker, Okapi at TREC text retrieval conference. pp. 109- 123 ,(1994)
Wei Zheng, Hui Fang, Query Aspect Based Term Weighting Regularization in Information Retrieval Lecture Notes in Computer Science. pp. 344- 356 ,(2010) , 10.1007/978-3-642-12275-0_31
Gerard Salton, Automatic text processing: the transformation, analysis, and retrieval of information by computer Addison-Wesley Longman Publishing Co., Inc.. ,(1989)
Marianne Lykke, Birger Larsen, Haakon Lund, Peter Ingwersen, Developing a Test Collection for the Evaluation of Integrated Search Lecture Notes in Computer Science. pp. 627- 630 ,(2010) , 10.1007/978-3-642-12275-0_63
Amit Singhal, Chris Buckley, Manclar Mitra, Pivoted document length normalization international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 21- 29 ,(1996) , 10.1145/3130348.3130365
Brian E. Dearing, Frederick Hartwig, Exploratory Data Analysis ,(1979)
Warren R. Greiff, A theory of term weighting based on exploratory data analysis international acm sigir conference on research and development in information retrieval. pp. 11- 19 ,(1998) , 10.1145/290941.290948
Donald Metzler, W. Bruce Croft, A Markov random field model for term dependencies international acm sigir conference on research and development in information retrieval. pp. 472- 479 ,(2005) , 10.1145/1076034.1076115