Optimizing query execution for variable-aligned length compression of bitmap indices

作者: Ryan Slechta , Jason Sawin , Ben McCamish , David Chiu , Guadalupe Canahuate

DOI: 10.1145/2628194.2628252

关键词:

摘要: Indexing is a fundamental mechanism for efficient data access. Recently, we proposed the Variable-Aligned Length (VAL) bitmap index encoding framework, which generalizes commonly used word-aligned compression techniques. VAL presented variable-aligned allows columns of to be compressed using different lengths. This flexibility creates tunable that balances trade-off between space and query processing time. The variable format presents several unique opportunities optimization. In this paper explore multiple algorithms optimize both point queries range in VAL. particular, propose dynamic encoding-length translation heuristic process queries. For queries, column orderings based on bitmap's metadata: largest segment length first (lsf), size (size), weighted (ws). our empirical study over real synthetic sets, show selection scheme produces execution times only 3.5% below optimal. We also found ordering significantly consistently out-performs other Finally, scale sets are row-ordered.

参考文章(19)
Fabian Corrales, David Chiu, Jason Sawin, Variable Length Compression for Bitmap Indices Lecture Notes in Computer Science. pp. 381- 395 ,(2011) , 10.1007/978-3-642-23091-2_32
Frank Olken, Doron Rotem, Linda Wong, Hsiu-Fen Liu, Harry K. T. Wong, Bit transposed files very large data bases. pp. 448- 457 ,(1985)
Kesheng Wu, Ekow Otoo, Arie Shoshani, On the performance of bitmap indices for high cardinality attributes very large data bases. pp. 24- 35 ,(2004) , 10.1016/B978-012088469-8.50006-1
Tan Apaydin, Ali Şaman Tosun, Hakan Ferhatosmanoglu, Analysis of Basic Data Reordering Techniques statistical and scientific database management. pp. 517- 524 ,(2008) , 10.1007/978-3-540-69497-7_33
G. Antoshenkov, Byte-aligned bitmap compression data compression conference. pp. 476- 476 ,(1995) , 10.1109/DCC.1995.515586
Kamel Aouiche, Owen Kaser, Daniel Lemire, Sorting improves word-aligned bitmap indexes data and knowledge engineering. ,vol. 69, pp. 3- 28 ,(2010) , 10.1016/J.DATAK.2009.08.006
Rishi Rakesh Sinha, Marianne Winslett, Multi-resolution bitmap indexes for scientific data ACM Transactions on Database Systems. ,vol. 32, pp. 16- ,(2007) , 10.1145/1272743.1272746
Francesco Fusco, Marc Ph. Stoecklin, Michail Vlachos, NET-FLi Proceedings of the VLDB Endowment. ,vol. 3, pp. 1382- 1393 ,(2010) , 10.14778/1920841.1921011
Alessandro Colantonio, Roberto Di Pietro, Concise: Compressed 'n' Composable Integer Set Information Processing Letters. ,vol. 110, pp. 644- 650 ,(2010) , 10.1016/J.IPL.2010.05.018