Efficient sort-based skyline evaluation

作者: Ilaria Bartolini , Paolo Ciaccia , Marco Patella

DOI: 10.1145/1412331.1412343

关键词: sortMINCSet (abstract data type)TupleSkylineFunction (mathematics)AlgorithmComputer scienceSortingRelation (database)Information Systems

摘要: Skyline queries compute the set of Pareto-optimal tuples in a relation, that is, those are not dominated by any other tuple same relation. Although several algorithms have been proposed for efficiently evaluating skyline queries, they either necessitate relation to indexed or perform dominance tests on all order determine result. In this article we introduce salsa, novel algorithm exploits idea presorting input data so as effectively limit number be read and compared. This makes salsa also attractive when executed top systems do understand semantics, logic runs clients with limited power and/or bandwidth. We prove that, if one considers symmetric sorting functions, is minimized according “minimum coordinate,” minC, criterion, performance can further improved distribution known an asymmetric function used. Experimental results obtained synthetic real datasets show consistently outperforms state-of-the-art sequential its accurately predicted.

参考文章(24)
M. Tamer Özsu, Paolo Ciaccia, Ilaria Bartolini, Vincent Oria, Integrating the Results of Multimedia Sub-Queries Using Qualitative Preferences. 10th International Workshop on Multimedia Information Systems (MIS 2004). pp. 66- 75 ,(2004)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Wolf-Tilo Balke, Ulrich Güntzer, Jason Xin Zheng, Efficient Distributed Skylining for Web Information Systems extending database technology. ,vol. 2992, pp. 256- 273 ,(2004) , 10.1007/978-3-540-24741-8_16
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
Parke Godfrey, Skyline Cardinality for Relational Processing foundations of information and knowledge systems. pp. 78- 97 ,(2004) , 10.1007/978-3-540-24627-5_7
Christian Buchta, On the average number of maxima in a set of vectors Information Processing Letters. ,vol. 33, pp. 63- 65 ,(1989) , 10.1016/0020-0190(89)90156-7
Ilaria Bartolini, Paolo Ciaccia, Marco Patella, SaLSa Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 405- 414 ,(2006) , 10.1145/1183614.1183674
Ilaria Bartolini, Paolo Ciaccia, Vincent Oria, M. Tamer Özsu, Flexible integration of multimedia sub-queries with qualitative preferences Multimedia Tools and Applications. ,vol. 33, pp. 273- 273 ,(2007) , 10.1007/S11042-007-0103-1
Jan Chomicki, Preference formulas in relational queries ACM Transactions on Database Systems. ,vol. 28, pp. 427- 466 ,(2003) , 10.1145/958942.958946
Michael I. Shamos, Franco P. Preparata, Computational Geometry: An Introduction ,(1978)