Efficient Compressed Inverted Index Skipping for Disjunctive Text-Queries

作者: Simon Jonassen , Svein Erik Bratsberg

DOI: 10.1007/978-3-642-20161-5_53

关键词:

摘要: In this paper we look at a combination of bulk-compression, partial query processing and skipping for document-ordered inverted indexes. We propose new index organization, provide an updated version the MaxScore method by Turtle Flood skipping-adapted space-limited adaptive pruning Lester et al. Both our methods significantly reduce number processed elements average latency more than three times. Our experiments with real implementation large document collection are valuable further research within optimizations.

参考文章(20)
Maxime Crochemore, Costas Iliopoulos, Marcin Kubica, Jakub Radoszewski, Wojciech Rytter, Tomasz Waleń, Extracting powers and periods in a string from its runs structure string processing and information retrieval. ,vol. 6393, pp. 258- 269 ,(2010) , 10.1007/978-3-642-16321-0_27
Nicholas Lester, Alistair Moffat, William Webber, Justin Zobel, Space-Limited ranked query evaluation using adaptive pruning web information systems engineering. pp. 470- 477 ,(2005) , 10.1007/11581062_37
Paolo Boldi, Sebastiano Vigna, Compressed Perfect Embedded Skip Lists for Quick Inverted-Index Lookups String Processing and Information Retrieval. ,vol. 3772, pp. 25- 28 ,(2005) , 10.1007/11575832_3
Stefan Büttcher, Ian Soboroff, Charles L. A. Clarke, The TREC 2006 Terabyte Track text retrieval conference. ,(2006)
Flavio Chierichetti, Silvio Lattanzi, Federico Mari, Alessandro Panconesi, On placing skips optimally in expectation web search and data mining. pp. 15- 24 ,(2008) , 10.1145/1341531.1341537
Howard Turtle, James Flood, Query evaluation: strategies and optimizations Information Processing and Management. ,vol. 31, pp. 831- 850 ,(1995) , 10.1016/0306-4573(95)00020-H
Chris Buckley, Alan F. Lewit, Optimization of inverted vector searches Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '85. pp. 97- 110 ,(1985) , 10.1145/253495.253515
Jiangong Zhang, Xiaohui Long, Torsten Suel, Performance of compressed inverted list caching in search engines Proceeding of the 17th international conference on World Wide Web - WWW '08. pp. 387- 396 ,(2008) , 10.1145/1367497.1367550
Trevor Strohman, W. Bruce Croft, Efficient document retrieval in main memory international acm sigir conference on research and development in information retrieval. pp. 175- 182 ,(2007) , 10.1145/1277741.1277774
Trevor Strohman, Howard Turtle, W. Bruce Croft, Optimization strategies for complex queries international acm sigir conference on research and development in information retrieval. pp. 219- 225 ,(2005) , 10.1145/1076034.1076074