Frequentist and Bayesian Approach to Information Retrieval

作者: Giambattista Amati

DOI: 10.1007/11735106_3

关键词:

摘要: We introduce the hypergeometric models KL, DLH and DLLH using DFR approach, we compare these to other relevant of IR. The are based on probability observing two probabilities: relative within-document term frequency entire collection frequency. Hypergeometric parameter-free Experiments show that have an excellent performance with small very large collections. provide their foundations from same IR space language modelling (LM). finally discuss difference between LM. Briefly, is a frequentist (Type I), or combinatorial whilst use Bayesian II) approach for mixing probabilities, being thus inherently parametric in its nature.

参考文章(28)
Iadh Ounis, Vassilis Plachouras, Ben He, University of Glasgow at TREC 2004: Experiments in Web, Robust, and Terabyte Tracks with Terrier. text retrieval conference. ,(2004)
Gianni Amati, Claudio Carpineto, Giovanni Romano, Fondazione Ugo Bordoni, None, FUB at TREC-10 Web track: A probabilistic framework for topic relevance term weighting text retrieval conference. pp. 182- 191 ,(2001)
Gerard Salton, The SMART retrieval system ,(1971)
Gianni Amati, Claudio Carpineto, Giovanni Romano, None, Fondazione Ugo Bordoni at TREC 2004. text retrieval conference. ,(2004)
Stephen Paul Harter, A probabilistic approach to automatic keyword indexing Journal of the Association for Information Science and Technology. ,(1974)
F. Jelinek, Interpolated estimation of Markov source parameters from sparse data Proc. Workshop on Pattern Recognition in Practice, 1980. pp. 381- 397 ,(1980)
Giambattista Amati, Probability models for information retrieval based on divergence from randomness British Library, British Thesis Service. ,(2003)
Iadh Ounis, Gianni Amati, Vassilis Plachouras, Ben He, Craig Macdonald, Douglas Johnson, Terrier information retrieval platform european conference on information retrieval. pp. 517- 519 ,(2005) , 10.1007/978-3-540-31865-1_37
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
Chengxiang Zhai, John Lafferty, Model-based feedback in the language modeling approach to information retrieval Proceedings of the tenth international conference on Information and knowledge management - CIKM'01. pp. 403- 410 ,(2001) , 10.1145/502585.502654