Top-K color queries for document retrieval

作者: Marek Karpinski , Yakov Nekrich

DOI: 10.5555/2133036.2133068

关键词:

摘要: In this paper we describe a new efficient (in fact optimal) data structure for the top-K color problem. Each element of an array A is assigned c with priority p(c). For query range [a, b] and value K, have to report K colors highest priorities among all that occur in A[a..b], sorted reverse order by their priorities. We show such queries can be answered O(K) time using O(N log σ) bits structure, where N number elements σ colors. Thus our asymptotically optimal respect worst-case space. As immediate application results, obtain solutions several document retrieval problems. The method could also independent interest.

参考文章(20)
Yossi Matias, S. Muthukrishnan, Süleyman Cenk Sahinalp, Jacob Ziv, Augmenting Suffix Trees, with Applications european symposium on algorithms. pp. 67- 78 ,(1998) , 10.1007/3-540-68530-8_6
Gerth Stølting Brodal, Rolf Fagerberg, Mark Greve, Alejandro López-Ortiz, Online Sorted Range Reporting international symposium on algorithms and computation. pp. 173- 182 ,(2009) , 10.1007/978-3-642-10631-6_19
Yakov Nekrich, External memory range reporting on a grid international symposium on algorithms and computation. pp. 525- 535 ,(2007) , 10.1007/978-3-540-77120-3_46
Iwona Bialynicka-Birula, Roberto Grossi, Rank-Sensitive Data Structures String Processing and Information Retrieval. pp. 79- 90 ,(2005) , 10.1007/11575832_10
Rajeev Motwani, Terry Winograd, Lawrence Page, Sergey Brin, The PageRank Citation Ranking : Bringing Order to the Web the web conference. ,vol. 98, pp. 161- 172 ,(1999)
Simon J. Puglisi, Andrew Turpin, Travis Gagie, Range Quantile Queries: Another Virtue of Wavelet Trees string processing and information retrieval. ,vol. 5721, pp. 1- 6 ,(2009) , 10.1007/978-3-642-03784-9_1
Kunihiko Sadakane, Succinct data structures for flexible text retrieval systems Journal of Discrete Algorithms. ,vol. 5, pp. 12- 22 ,(2007) , 10.1016/J.JDA.2006.03.011
Alok Aggarwal, Jeffrey,S. Vitter, The input/output complexity of sorting and related problems Communications of The ACM. ,vol. 31, pp. 1116- 1127 ,(1988) , 10.1145/48529.48535
P. Gupta, R. Janardan, M. Smid, Further results on generalized intersection searching problems: counting, reporting, and dynamization Journal of Algorithms. ,vol. 19, pp. 282- 317 ,(1995) , 10.1006/JAGM.1995.1038