CRB-Tree: An Efficient Indexing Scheme for Range-Aggregate Queries

作者: Sathish Govindarajan , Pankaj K. Agarwal , Lars Arge

DOI: 10.1007/3-540-36285-1_10

关键词: Spatial databaseSearch engine indexingAggregate (data warehouse)Tree (graph theory)SargableSet (abstract data type)Computer scienceAlgorithmQuery optimization

摘要: We propose a newindexing scheme, called the CRB-tree, for efficiently answering range-aggregate queries. The problem is defined as follows: Given set of weighted points in Rd, compute aggregate weights that lie inside d-dimensional query rectangle. In this paper we focus on range-COUNT, SUM, AVG aggregates. First, develop an indexing scheme two-dimensional range-COUNT queries uses O(N/B) disk blocks and answers O(logB N) I/Os, where N number input B block size. This first optimal index structure 2D problem. can be extended to obtain near-linear-size range-SUM using I/Os. also similar bounds rectangle-intersection queries, which rectangles asks those overlap with result immediately improves recent temporal-aggregate Our dynamized higher dimensions. Finally, demonstrate practical efficiency our by comparing its performance against kdB-tree. For dataset around 100 million points, CRB-tree time 8-10 times faster than kdB-tree time. Furthermore, unlike other schemes, oblivious distribution placement, shape size

参考文章(27)
Steven Geffner, Divakant Agrawal, Amr El Abbadi, The Dynamic Data Cube extending database technology. pp. 237- 253 ,(2000) , 10.1007/3-540-46439-5_17
Yufei Tao, Dimitris Papadias, Jun Zhang, Aggregate Processing of Planar Points extending database technology. pp. 682- 700 ,(2002) , 10.1007/3-540-45876-X_42
J.-R. Sack, J. Urrutia, Handbook of computational geometry North-Holland Publishing Co.. ,(2000)
Lars Arge, Octavian Procopiuc, Jeffrey Scott Vitter, Implementing I/O-efficient Data Structures Using TPIE european symposium on algorithms. pp. 88- 100 ,(2002) , 10.1007/3-540-45749-6_12
Jong Soo Kim, Sung Tak Kang, Myoung Ho Kim, Effective Temporal Aggregation Using Point-Based Trees database and expert systems applications. pp. 1018- 1030 ,(1999) , 10.1007/3-540-48309-8_96
Yannis E. Ioannidis, Chee Yong Chan, Hierarchical Prefix Cubes for Range-Sum Queries very large data bases. pp. 675- 686 ,(1999)
Lars Arge, Jan Vahrenhold, I/O-efficient dynamic planar point location (extended abstract) symposium on computational geometry. pp. 191- 200 ,(2000) , 10.1145/336154.336205
Jeffrey Scott Vitter, Min Wang, Approximate computation of multidimensional aggregates of sparse data using wavelets ACM SIGMOD Record. ,vol. 28, pp. 193- 204 ,(1999) , 10.1145/304181.304199
Nick Roussopoulos, Yannis Kotidis, Mema Roussopoulos, Cubetree: organization of and bulk incremental updates on the data cube international conference on management of data. ,vol. 26, pp. 89- 99 ,(1997) , 10.1145/253260.253276
Alok Aggarwal, Jeffrey,S. Vitter, The input/output complexity of sorting and related problems Communications of The ACM. ,vol. 31, pp. 1116- 1127 ,(1988) , 10.1145/48529.48535