Click model-based information retrieval metrics

作者: Aleksandr Chuklin , Pavel Serdyukov , Maarten de Rijke

DOI: 10.1145/2484028.2484071

关键词:

摘要: In recent years many models have been proposed that are aimed at predicting clicks of web search users. In addition, some information retrieval evaluation metrics have been built on top of a user model. In this paper we bring these two directions together and propose a common approach to converting any click model into an evaluation metric. We then put the resulting model-based metrics as well as traditional metrics (like DCG or Precision) into a common evaluation framework and compare them along a number of dimensions. One of …

参考文章(41)
Michael Keen, Cyril Cleverdon, Jack Mills, Factors determining the performance of indexing systems [s.n.]. ,(1966)
Ben Carterette, Virgil Pavlu, Evangelos Kanoulas, Javed A. Aslam, James Allan, If I Had a Million Queries Lecture Notes in Computer Science. pp. 288- 300 ,(2009) , 10.1007/978-3-642-00958-7_27
Richard Berendsen, Manos Tsagkias, Maarten de Rijke, Edgar Meij, Generating pseudo test collections for learning to rank scientific articles cross language evaluation forum. ,vol. 7488, pp. 42- 53 ,(2012) , 10.1007/978-3-642-33247-0_6
Andrew Turpin, Falk Scholer, Kalvero Jarvelin, Mingfang Wu, J. Shane Culpepper, Including summaries in system evaluation international acm sigir conference on research and development in information retrieval. pp. 508- 515 ,(2009) , 10.1145/1571941.1572029
Olivier Chapelle, Thorsten Joachims, Filip Radlinski, Yisong Yue, Large-scale validation and analysis of interleaved search evaluation ACM Transactions on Information Systems. ,vol. 30, pp. 1- 41 ,(2012) , 10.1145/2094072.2094078
Nick Craswell, Onno Zoeter, Michael Taylor, Bill Ramsey, An experimental comparison of click position-bias models web search and data mining. pp. 87- 94 ,(2008) , 10.1145/1341531.1341545
Yisong Yue, Yue Gao, Oliver Chapelle, Ya Zhang, Thorsten Joachims, Learning more powerful test statistics for click-based retrieval evaluation international acm sigir conference on research and development in information retrieval. pp. 507- 514 ,(2010) , 10.1145/1835449.1835534
Charles L.A. Clarke, Nick Craswell, Ian Soboroff, Azin Ashkan, A comparative analysis of cascade measures for novelty and diversity web search and data mining. pp. 75- 84 ,(2011) , 10.1145/1935826.1935847
Georges E. Dupret, Benjamin Piwowarski, A user browsing model to predict search engine click data from past observations. Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 331- 338 ,(2008) , 10.1145/1390334.1390392
Mark D. Smucker, Charles L.A. Clarke, Time-based calibration of effectiveness measures Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval - SIGIR '12. pp. 95- 104 ,(2012) , 10.1145/2348283.2348300