An OLAP-based Scalable Web Access Analysis Engine

作者: Qiming Chen , Umeshwar Dayal , Meichun Hsu

DOI: 10.1007/3-540-44466-1_21

关键词:

摘要: Collecting and mining web lag records (WLRs) from e-commerce sites has become increasingly important for targeted marketing, promotions, traffic analysis. In this paper, we describe a scalable data werehousing OLAP-based engine analyzing WLRs. We have to address several scalability performance challenges in developing such framework. Because an active site may generate hundreds of millions WLRs daily, deal with huge volumes flow rates. To support fine-grained analysis, e.g., individual users' access profiles, end up huge, sparse cubes defined over very large-sized dimensions (there be hunderds thousands visitors the tens pages). While OLAP servers store quite efficiently, rolling large cube can take prohibitively long. applied non-traditional approaches problem, which allow us speed WLR analysis by 3 orders magnitude. Our framework multilevel multidimensional pattern extraction, feature ranking, addition typical operations, supports operations as extended association rules.

参考文章(9)
Piero Fraternali, Stefano Paraboschi, Stefano Ceri, Data-Driven, One-To-One Web Site Generation for Data-Intensive Applications very large data bases. pp. 615- 626 ,(1999)
Dan Suciu, Daniela Florescu, Alon Y. Levy, Khaled Yagoub, Optimization of Run-time Management of Data Intensive Web-sites very large data bases. pp. 627- 638 ,(1999)
H. V. Jagadish, Divesh Srivastava, Laks V. S. Lakshmanan, What can Hierarchies do for Data Warehouses very large data bases. pp. 530- 541 ,(1999)
Q. Chen, M. Hsu, U. Dayal, A data-warehouse/OLAP framework for scalable telecommunication tandem traffic analysis international conference on data engineering. pp. 201- 210 ,(2000) , 10.1109/ICDE.2000.839413
Dimitrios Gunopulos, George Kollios, Vassilis J. Tsotras, Carlotta Domeniconi, Approximating multi-dimensional aggregate range queries over real attributes international conference on management of data. ,vol. 29, pp. 463- 474 ,(2000) , 10.1145/335191.335448
Qiming Chen, U. Dayal, M. Hsu, A distributed OLAP infrastructure for e-commerce cooperative information systems. pp. 209- 220 ,(1999) , 10.1109/COOPIS.1999.792171
Surajit Chaudhuri, Umeshwar Dayal, An overview of data warehousing and OLAP technology international conference on management of data. ,vol. 26, pp. 65- 74 ,(1997) , 10.1145/248603.248616
Torben Bach Pedersen, Christian S. Jensen, Curtis E. Dyreson, Extending Practical Pre-Aggregation in On-Line Analytical Processing very large data bases. pp. 663- 674 ,(1999)
Sridhar Rajagopalan, Andrew Tomkins, Prabhakar Raghavan, Ravi Kumar, Extracting Large-Scale Knowledge Bases from the Web very large data bases. pp. 639- 650 ,(1999)