Distributed Skycube Computation with Anthill

作者: Renê R. Veloso , Loïc Cerf , Chedy Raïssi , Wagner Meira Jr.

DOI: 10.1109/SBAC-PAD.2011.29

关键词:

摘要: Recently skyline queries have gained considerable attention and are among the most important tools for multi-criteria analysis. In order to process all possible combinations of criteria along with their inherent analysis, researchers introduced studied notion \emph{skycube}. Simply put, a skycube is pre-materialization subspaces associated skylines. An efficient computation relies on detection redundancies in different processing steps enhanced result sharing between subspaces. Lately, Orion algorithm was proposed compute very way. The approach derivation points over Nevertheless, because there 2^{|D|} - 1 (where D set dimensions) skycube, running time still grows exponentially number dimensions easily becomes intractable real-world datasets. this study, we detail distribution within \emph{filter-stream} framework conduct an extensive experiments large datasets collected from Twitter demonstrate efficiency our method.

参考文章(23)
Jarek Gryz, Ryan Shipley, Parke Godfrey, Maximal vector computation in large data sets very large data bases. pp. 229- 240 ,(2005)
Wolf-Tilo Balke, Ulrich Güntzer, Jason Xin Zheng, Efficient Distributed Skylining for Web Information Systems extending database technology. ,vol. 2992, pp. 256- 273 ,(2004) , 10.1007/978-3-540-24741-8_16
Beng Chin Ooi, Pin-Kwang Eng, Kian-Lee Tan, Efficient Progressive Skyline Computation very large data bases. pp. 301- 310 ,(2001)
R.A. Ferreira, W.Jr. Meira, D. Guedes, L.M.A. Drummond, Bruno Coutinho, G. Teodoro, T. Tavares, R. Araujo, G.T. Ferreira, Anthill: a scalable run-time environment for data mining applications symposium on computer architecture and high performance computing. pp. 159- 167 ,(2005) , 10.1109/CAHPC.2005.12
Sungwoo Park, Taekyung Kim, Jonghyun Park, Jinha Kim, Hyeonseung Im, Parallel Skyline Computation on Multicore Architectures 2009 IEEE 25th International Conference on Data Engineering. pp. 760- 771 ,(2009) , 10.1109/ICDE.2009.42
Jongwuk Lee, Seung-won Hwang, QSkycube Proceedings of the VLDB Endowment. ,vol. 4, pp. 185- 196 ,(2010) , 10.14778/1929861.1929865
Akrivi Vlachou, Christos Doulkeridis, Yannis Kotidis, Angle-based space partitioning for efficient parallel skyline computation Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08. pp. 227- 238 ,(2008) , 10.1145/1376616.1376642
Gregory R. Andrews, Paradigms for process interaction in distributed programs ACM Computing Surveys. ,vol. 23, pp. 49- 90 ,(1991) , 10.1145/103162.103164
Bin Cui, Hua Lu, Quanqing Xu, Lijiang Chen, Yafei Dai, Yongluan Zhou, Parallel Distributed Processing of Constrained Skyline Queries by Filtering 2008 IEEE 24th International Conference on Data Engineering. pp. 546- 555 ,(2008) , 10.1109/ICDE.2008.4497463
Chedy Raïssi, Jian Pei, Thomas Kister, Computing closed skycubes Proceedings of the VLDB Endowment. ,vol. 3, pp. 838- 847 ,(2010) , 10.14778/1920841.1920948