Supporting personalized top-k skyline queries using partial compressed skycube

作者: Jongwuk Lee , Gae-won You , IkChan Sohn , Seung-won Hwang , Kwangil Ko

DOI: 10.1145/1316902.1316914

关键词:

摘要: As near-infinite amount of data are becoming accessible on the Web, it is getting more and important to support intelligent query mechanisms, help each user identify ideal results manageable size. such mechanism, skyline queries have gained a lot attention lately for its intuitive formulation. This intuitiveness, however, has side-effect generating too many results, especially high-dimensional data, satisfy wide range user's needs. Our goal personalized as identifying "truly interesting" objects based user-specific preference retrieval size k. While this problem been studied previously, proposed solution identifies top-k by navigating "skycube", which incurs exponential storage overhead dimensionality excessive one-time computational skycube construction. In contrast, we develop novel techniques significantly reduce both computation overhead. extensive evaluation validate framework real-life synthetic data.

参考文章(21)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
Parke Godfrey, Skyline Cardinality for Relational Processing foundations of information and knowledge systems. pp. 78- 97 ,(2004) , 10.1007/978-3-540-24627-5_7
Jon Louis Bentley, Hsiang-Tsung Kung, Mario Schkolnick, Clark D Thompson, On the Average Number of Maxima in a Set of Vectors and Applications Journal of the ACM. ,vol. 25, pp. 536- 543 ,(1978) , 10.1145/322092.322095
H. T. Kung, F. Luccio, F. P. Preparata, On Finding the Maxima of a Set of Vectors Journal of the ACM. ,vol. 22, pp. 469- 476 ,(1975) , 10.1145/321906.321910
Dimitris Papadias, Yufei Tao, Greg Fu, Bernhard Seeger, An optimal and progressive algorithm for skyline queries international conference on management of data. pp. 467- 478 ,(2003) , 10.1145/872757.872814
Jian Pei, Ada Wai-Chee Fu, Xuemin Lin, Haixun Wang, Computing Compressed Multidimensional Skyline Cubes Efficiently international conference on data engineering. pp. 96- 105 ,(2007) , 10.1109/ICDE.2007.367855
Chee-Yong Chan, H. V. Jagadish, Kian-Lee Tan, Anthony K. H. Tung, Zhenjie Zhang, Finding k-dominant skylines in high dimensional space international conference on management of data. pp. 503- 514 ,(2006) , 10.1145/1142473.1142530
Xuemin Lin, Yidong Yuan, Qing Zhang, Ying Zhang, Selecting Stars: The k Most Representative Skyline Operator international conference on data engineering. pp. 86- 95 ,(2007) , 10.1109/ICDE.2007.367854
Donald Kossmann, Frank Ramsak, Steffen Rost, Shooting stars in the sky: an online algorithm for skyline queries very large data bases. pp. 275- 286 ,(2002) , 10.1016/B978-155860869-6/50032-9