Mining history of changes to web access patterns

作者: Qiankun Zhao , Sourav S. Bhowmick

DOI: 10.1007/978-3-540-30116-5_53

关键词: Web miningSnapshot (computer storage)Tree traversalBusiness intelligenceWorld Wide WebScalabilityWeb accessPersonalizationTimestampComputer science

摘要: Recently, a lot of work has been done in web usage mining [2]. Among them, frequent Web Access Pattern (WAP) is the most well researched issue [1]. The idea to transform logs into sequences events with user identifications and timestamps, then extract association sequential patterns from data certain metrics. WAPs have applied wide range applications such as personalization, system improvement, site modification, business intelligence, characterization However, existing techniques focus only on WAP snapshot data, while dynamic real life. While are useful many applications, knowledge hidden behind historical changes which reflects how change, also critical adaptive web, maintenance, etc.In this paper, we propose novel approach discover WAPs. Rather than focusing occurrence WAPs, frequently changing access patterns. We define type knowledge, Frequent Mutating (FM-WAP), based FM-WAP process consists three phases. Firstly, represented set trees partitioned sequence groups ( subsets trees) according user-defined calendar pattern, where each group forest. Consequently, log by forests called history. Then, among history detected stored global Finally, extracted traversal Extensive experiments show that our proposed can produce efficiently good scalability.

参考文章(2)
Jian Pei, Jiawei Han, Behzad Mortazavi-asl, Hua Zhu, Mining Access Patterns Efficiently from Web Logs pacific asia conference on knowledge discovery and data mining. pp. 396- 407 ,(2000) , 10.1007/3-540-45571-X_47
Jaideep Srivastava, Robert Cooley, Mukund Deshpande, Pang-Ning Tan, Web usage mining ACM SIGKDD Explorations Newsletter. ,vol. 1, pp. 12- 23 ,(2000) , 10.1145/846183.846188