Skyline Query Processing for Incomplete Data

作者: Mohamed E. Khalefa , Mohamed F. Mokbel , Justin J. Levandoski

DOI: 10.1109/ICDE.2008.4497464

关键词:

摘要: Recently, there has been much interest in processing skyline queries for various applications that include decision making, personalized services, and search pruning. Skyline aim to prune a space of large numbers multi dimensional data items small set interesting by eliminating are dominated others. Existing algorithms assume all dimensions available items. This paper goes beyond this restrictive assumption as we address the more practical case involving incomplete (i.e., missing values some their dimensions). In contrast complete where dominance relation is transitive, suffer from non-transitive which may lead cyclic behavior. We first propose two algorithms, namely, "Replacement" "Bucket" use traditional data. Then, "ISkyline" algorithm designed specifically The employs optimization techniques, virtual points shadow skylines tolerate relations. Experimental evidence shows significantly outperforms variations algorithms.

参考文章(21)
Cyrus Shahabi, Mehdi Sharifzadeh, The spatial skyline queries very large data bases. pp. 751- 762 ,(2006) , 10.5555/1182635.1164192
Zhiyong Huang, C.S. Jensen, Hua Lu, Beng Chin Ooi, Skyline Queries Against Mobile Lightweight Devices in MANETs international conference on data engineering. pp. 66- 66 ,(2006) , 10.1109/ICDE.2006.142
Zhiyong Huang, Hua Lu, Beng Chin Ooi, A.K.H. Tung, Continuous Skyline Queries for Moving Objects IEEE Transactions on Knowledge and Data Engineering. ,vol. 18, pp. 1645- 1658 ,(2006) , 10.1109/TKDE.2006.185
Jeffrey Xu Yu, Xuemin Lin, Yidong Yuan, Qing Zhang, Qing Liu, Wei Wang, Efficient computation of the skyline cube very large data bases. pp. 241- 252 ,(2005)
S. Borzsony, D. Kossmann, K. Stocker, The Skyline operator international conference on data engineering. pp. 421- 430 ,(2001) , 10.1109/ICDE.2001.914855
Martin Ester, Jiawei Han, Wen Jin, Mining thick skylines over large databases Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). ,vol. 3202, pp. 255- 266 ,(2004)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Wolf-Tilo Balke, Ulrich Güntzer, Jason Xin Zheng, Efficient Distributed Skylining for Web Information Systems extending database technology. ,vol. 2992, pp. 256- 273 ,(2004) , 10.1007/978-3-540-24741-8_16
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
Chee-Yong Chan, Pin-Kwang Eng, Kian-Lee Tan, Stratified computation of skylines with partially-ordered domains Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05. pp. 203- 214 ,(2005) , 10.1145/1066157.1066181