Query Aspect Based Term Weighting Regularization in Information Retrieval

作者: Wei Zheng , Hui Fang

DOI: 10.1007/978-3-642-12275-0_31

关键词:

摘要: Traditional retrieval models assume that query terms are independent and rank documents primarily based on various term weighting strategies including TF-IDF document length normalization. However, related, groups of semantically related may form aspects. Intuitively, the relations among could be utilized to identify hidden aspects promote ranking covering more Despite its importance, use semantic for regularization has been under-explored in information retrieval. In this paper, we study incorporation into existing focus addressing challenge, i.e., how regularize weights different improve performance. Specifically, first develop a general strategy can systematically integrate function functions, then propose two specific functions guidance provided by constraint analysis. Experiments eight standard TREC data sets show proposed methods effective accuracy.

参考文章(24)
Amit Singhal, Chris Buckley, Manclar Mitra, Pivoted document length normalization international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 21- 29 ,(1996) , 10.1145/3130348.3130365
Michael Bendersky, W. Bruce Croft, Discovering key concepts in verbose queries Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 491- 498 ,(2008) , 10.1145/1390334.1390419
Matthew Lease, None, An improved markov random field model for supporting verbose queries international acm sigir conference on research and development in information retrieval. pp. 476- 483 ,(2009) , 10.1145/1571941.1572023
Chris Buckley, Why current IR engines fail Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04. pp. 584- 585 ,(2004) , 10.1145/1008992.1009132
Donald Metzler, W. Bruce Croft, A Markov random field model for term dependencies international acm sigir conference on research and development in information retrieval. pp. 472- 479 ,(2005) , 10.1145/1076034.1076115
Jay M. Ponte, W. Bruce Croft, A language modeling approach to information retrieval international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 275- 281 ,(1998) , 10.1145/3130348.3130368
C.J. VAN RIJSBERGEN, A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVAL Journal of Documentation. ,vol. 33, pp. 106- 119 ,(1977) , 10.1108/EB026637
Shuang Liu, Fang Liu, Clement Yu, Weiyi Meng, An effective approach to document retrieval via utilizing WordNet and recognizing phrases Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04. pp. 266- 272 ,(2004) , 10.1145/1008992.1009039
Hui Fang, Tao Tao, ChengXiang Zhai, A formal study of information retrieval heuristics Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04. pp. 49- 56 ,(2004) , 10.1145/1008992.1009004
Chengxiang Zhai, John Lafferty, A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 334- 342 ,(2001) , 10.1145/3130348.3130377