MIMICS: A Large-Scale Data Collection for Search Clarification

作者: Nick Craswell , Hamed Zamani , Gord Lueck , Everest Chen , Flint Luu

DOI:

关键词:

摘要: Search clarification has recently attracted much attention due to its applications in search engines. It also been recognized as a major component conversational information seeking systems. Despite importance, the research community still feels lack of large-scale data for studying different aspects clarification. In this paper, we introduce MIMICS, collection datasets real web queries sampled from Bing query logs. Each MIMICS is generated by production algorithm and consists clarifying question up five candidate answers. contains three datasets: (1) MIMICS-Click includes over 400k unique queries, their associated panes, corresponding aggregated user interaction signals (i.e., clicks). (2) MIMICS-ClickExplore an exploration that 60k each with multiple panes. (3) MIMICS-Manual 2k queries. query-clarification pair dataset manually labeled at least trained annotators. graded quality labels question, answer set, landing result page answer. MIMICS publicly available purposes, thus enables researchers study number tasks related clarification, including generation selection, engagement prediction click models analyzing interactions

参考文章(28)
Luis Quintano, Irene Pimenta Rodrigues, Question/Answering Clarification Dialogues mexican international conference on artificial intelligence. pp. 155- 164 ,(2008) , 10.1007/978-3-540-88636-5_14
Hamed Zamani, Pooya Moradi, Azadeh Shakery, Adaptive User Engagement Evaluation via Multi-task Learning international acm sigir conference on research and development in information retrieval. pp. 1011- 1014 ,(2015) , 10.1145/2766462.2767785
Greg Pass, Abdur Chowdhury, Cayley Torgeson, A picture of search scalable information systems. pp. 1- ,(2006) , 10.1145/1146847.1146848
Nick Craswell, Onno Zoeter, Michael Taylor, Bill Ramsey, An experimental comparison of click position-bias models web search and data mining. pp. 87- 94 ,(2008) , 10.1145/1341531.1341545
Kalervo Järvelin, Jaana Kekäläinen, Cumulated gain-based evaluation of IR techniques ACM Transactions on Information Systems. ,vol. 20, pp. 422- 446 ,(2002) , 10.1145/582415.582418
Steve Cronen-Townsend, Yun Zhou, W. Bruce Croft, Predicting query performance Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '02. pp. 299- 306 ,(2002) , 10.1145/564376.564429
Mounia Lalmas, Elad Yom-Tov, Heather O'Brien, Measuring User Engagement ,(2014)
Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu, BLEU Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02. pp. 311- 318 ,(2001) , 10.3115/1073083.1073135
MARCO DE BONI, SURESH MANANDHAR, Implementing clarification dialogues in open domain question answering Natural Language Engineering. ,vol. 11, pp. 343- 361 ,(2005) , 10.1017/S1351324905003682
Chin-Yew Lin, Eduard Hovy, Automatic evaluation of summaries using N-gram co-occurrence statistics Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - NAACL '03. pp. 71- 78 ,(2003) , 10.3115/1073445.1073465