A Framework for Identifying Skylines over Incomplete Data

作者: Ali A. Alwan , Hamidah Ibrahim , Nur Izura Udzir

DOI: 10.1109/ACSAT.2014.21

关键词:

摘要: Skyline queries provide a flexible query operator that returns data items (skylines) which are not being dominated by other in all dimensions (attributes) of the database. Most existing skyline techniques determine skylines assuming values for every item available (complete). However, this assumption is always true particularly multidimensional database as some may be missing. The incompleteness leads to loss transitivity property technique and results into failure test dominance incomparable each other. Furthermore, influences negatively on process finding skylines, leading high overhead, due exhaustive pair wise comparisons between items. This paper proposed framework incomplete with aim avoiding issue cyclic deriving skylines. identifying consists four components, namely: Data Clustering Builder, Group Constructor Local Skylines Identifier, k-dom Generator, Incomplete Identifier. Including these processes has optimized reducing necessary number comparison through eliminating early possible before applying technique.

参考文章(37)
Nur Izura Udzir, Nurul Husna Mohd Saad, Hamidah Ibrahim, Fatimah Sidi, Chik Yip Tan, Ali Amer Alwan, Performance Evaluation of Preference Queries Techniques over a High Multidimensional Database Digital Information Research Foundation. ,(2011)
Zhenhua Huang, Wei Wang, A Novel Incremental Maintenance Algorithm of SkyCube Lecture Notes in Computer Science. pp. 781- 790 ,(2006) , 10.1007/11827405_76
Surajit Chaudhuri, Luis Gravano, Evaluating Top-k Selection Queries very large data bases. pp. 397- 410 ,(1999)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Martin Theobald, Gerhard Weikum, Ralf Schenkel, Top-k query evaluation with probabilistic guarantees very large data bases. pp. 648- 659 ,(2004) , 10.1016/B978-012088469-8.50058-9
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
Ilaria Bartolini, Paolo Ciaccia, Marco Patella, SaLSa Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 405- 414 ,(2006) , 10.1145/1183614.1183674
Parisa Haghani, Sebastian Michel, Karl Aberer, Evaluating top-k queries over incomplete data streams Proceeding of the 18th ACM conference on Information and knowledge management - CIKM '09. pp. 877- 886 ,(2009) , 10.1145/1645953.1646064
Zhenhua Huang, Shengli Sun, Wei Wang, Efficient mining of skyline objects in subspaces over data streams Knowledge and Information Systems. ,vol. 22, pp. 159- 183 ,(2010) , 10.1007/S10115-008-0185-8
Jongwuk Lee, Gae-won You, Seung-won Hwang, Personalized top-k skyline queries in high-dimensional space Information Systems. ,vol. 34, pp. 45- 61 ,(2009) , 10.1016/J.IS.2008.04.004