Effective Skyline Cardinality Estimation on Data Streams

作者: Yang Lu , Jiakui Zhao , Lijun Chen , Bin Cui , Dongqing Yang

DOI: 10.1007/978-3-540-85654-2_25

关键词: Data miningBloom filterCardinality (SQL statements)Computer scienceSkyline operatorData streamData stream miningSkylineQuery optimization

摘要: In order to incorporate the skyline operator into data stream engine, we need address problem of cardinality estimation, which is very important for extending query optimizer's cost model accommodate queries. this paper, propose robust approaches estimating over sliding windows in environment. We first design an approach estimate uniformly distributed data, and then extend support arbitrarily data. Our allow arbitrary distribution, hence can be applied model. To online manner, live elements window are sketched using Spectral Bloom Filters efficiently effectively capture information essential windows. Extensive experimental study demonstrates that our significantly outperform previous approaches.

参考文章(13)
Parke Godfrey, Skyline Cardinality for Relational Processing foundations of information and knowledge systems. pp. 78- 97 ,(2004) , 10.1007/978-3-540-24627-5_7
Christian Buchta, On the average number of maxima in a set of vectors Information Processing Letters. ,vol. 33, pp. 63- 65 ,(1989) , 10.1016/0020-0190(89)90156-7
Jon Louis Bentley, Hsiang-Tsung Kung, Mario Schkolnick, Clark D Thompson, On the Average Number of Maxima in a Set of Vectors and Applications Journal of the ACM. ,vol. 25, pp. 536- 543 ,(1978) , 10.1145/322092.322095
Saar Cohen, Yossi Matias, Spectral bloom filters international conference on management of data. pp. 241- 252 ,(2003) , 10.1145/872757.872787
H. T. Kung, F. Luccio, F. P. Preparata, On Finding the Maxima of a Set of Vectors Journal of the ACM. ,vol. 22, pp. 469- 476 ,(1975) , 10.1145/321906.321910
Dimitris Papadias, Yufei Tao, Greg Fu, Bernhard Seeger, An optimal and progressive algorithm for skyline queries international conference on management of data. pp. 467- 478 ,(2003) , 10.1145/872757.872814
Yufei Tao, Dimitris Papadias, Maintaining sliding window skylines on data streams IEEE Transactions on Knowledge and Data Engineering. ,vol. 18, pp. 377- 391 ,(2006) , 10.1109/TKDE.2006.48
Xuemin Lin, Yidong Yuan, Wei Wang, Hongjun Lu, Stabbing the sky: efficient skyline computation over sliding windows international conference on data engineering. pp. 502- 513 ,(2005) , 10.1109/ICDE.2005.137
Donald Kossmann, Frank Ramsak, Steffen Rost, Shooting stars in the sky: an online algorithm for skyline queries very large data bases. pp. 275- 286 ,(2002) , 10.1016/B978-155860869-6/50032-9
Burton H. Bloom, Space/time trade-offs in hash coding with allowable errors Communications of the ACM. ,vol. 13, pp. 422- 426 ,(1970) , 10.1145/362686.362692