Scalable skyline computation using a balanced pivot selection technique

作者: Jongwuk Lee , Seung-won Hwang

DOI: 10.1016/J.IS.2013.05.005

关键词:

摘要: Skyline queries have recently received considerable attention as an alternative decision-making operator in the database community. The conventional skyline algorithms primarily focused on optimizing dominance of points order to remove non-skyline efficiently possible, but neglected take into account incomparability bypass unnecessary comparisons. To design a scalable algorithm, we first analyze cost model that copes with both and incomparability, develop novel technique select cost-optimal point, called pivot minimizes number comparisons point-based space partitioning. We then implement proposed point selection existing sorting- partitioning-based algorithms. For insertions/deletions, also discuss how maintain current using skytree, derived from recursive Furthermore, efficient greedy algorithm for k representative skytree. Experimental results demonstrate are significantly faster than state-of-the-art

参考文章(38)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Wolf-Tilo Balke, Ulrich Güntzer, Jason Xin Zheng, Efficient Distributed Skylining for Web Information Systems extending database technology. ,vol. 2992, pp. 256- 273 ,(2004) , 10.1007/978-3-540-24741-8_16
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
Jan Chomicki, Parke Godfrey, Jarek Gryz, Dongming Liang, Skyline with Presorting: Theory and Optimizations intelligent information systems. pp. 595- 604 ,(2005) , 10.1007/3-540-32392-9_72
Ilaria Bartolini, Paolo Ciaccia, Marco Patella, Efficient sort-based skyline evaluation ACM Transactions on Database Systems. ,vol. 33, pp. 1- 49 ,(2008) , 10.1145/1412331.1412343
Sungwoo Park, Taekyung Kim, Jonghyun Park, Jinha Kim, Hyeonseung Im, Parallel Skyline Computation on Multicore Architectures 2009 IEEE 25th International Conference on Data Engineering. pp. 760- 771 ,(2009) , 10.1109/ICDE.2009.42
Jon Louis Bentley, Hsiang-Tsung Kung, Mario Schkolnick, Clark D Thompson, On the Average Number of Maxima in a Set of Vectors and Applications Journal of the ACM. ,vol. 25, pp. 536- 543 ,(1978) , 10.1145/322092.322095
Kenneth L. Clarkson, Jon L. Bentley, David B. Levine, Fast linear expected-time alogorithms for computing maxima and convex hulls symposium on discrete algorithms. pp. 179- 187 ,(1990) , 10.5555/320176.320196
Jongwuk Lee, Gae-won You, Seung-won Hwang, Personalized top-k skyline queries in high-dimensional space Information Systems. ,vol. 34, pp. 45- 61 ,(2009) , 10.1016/J.IS.2008.04.004