Discovering better navigation sequences for the session construction problem

作者: Murat Ali Bayir , Ismail Hakki Toroslu , Murat Demirbas , Ahmet Cosar

DOI: 10.1016/J.DATAK.2011.11.005

关键词: Information retrievalGraph (abstract data type)Page viewWeb accessWeb pageSession IDComputer scienceGraph problemWorld Wide WebStatic web pageWeb navigation

摘要: In this paper, we propose a novel page view based session model and construction method to address the Web Usage Mining (WUM) problem. Unlike simple models, where sessions are sequences of web pages requested from server (or served browser/proxy cache) viewed in browser (which may not guarantee direct relationship between subsequent session), define more realistic which is set paths traversed graph that corresponds user navigation performed by following links on pages. We process raw logs as new problem present algorithm, Smart-SRA (Smart Session Reconstruction Algorithm), solve efficiently. An experimental evaluation data collected real access scenarios showed produces accurate than methods found literature.

参考文章(35)
Pablo E. Román, Gastón L’Huillier, Juan D. Velásquez, Web Usage Mining Advanced Techniques in Web Intelligence - I. pp. 143- 165 ,(2010) , 10.1007/978-3-642-14461-5_6
Yongjian Fu, Ming-Yi Shih, A Framework for Personal Web Usage Mining. international conference on internet computing. pp. 595- 600 ,(2002)
Robert Walker Cooley, Jaideep Srivastava, Web usage mining: discovery and application of interesting patterns from web data University of Minnesota. ,(2000)
Robert F. Dell, Pablo E. Román, Juan D. Velásquez, Web User Session Reconstruction with Back Button Browsing Knowledge-Based and Intelligent Information and Engineering Systems. pp. 326- 332 ,(2009) , 10.1007/978-3-642-04595-0_40
José Borges, Mark Levene, Generating Dynamic Higher-Order Markov Models in Web Usage Mining Knowledge Discovery in Databases: PKDD 2005. pp. 34- 45 ,(2005) , 10.1007/11564126_9
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Mohammed J. Zaki, SPADE: An Efficient Algorithm for Mining Frequent Sequences Machine Learning. ,vol. 42, pp. 31- 60 ,(2001) , 10.1023/A:1007652502315
Robert Cooley, Pang-Ning Tan, Jaideep Srivastava, Discovery of Interesting Usage Patterns from Web Data Web Usage Analysis and User Profiling. pp. 163- 182 ,(2000) , 10.1007/3-540-44934-5_10
Sylvain Brohée, Jacques van Helden, Evaluation of clustering algorithms for protein-protein interaction networks BMC Bioinformatics. ,vol. 7, pp. 488- 488 ,(2006) , 10.1186/1471-2105-7-488
Murat Ali Bayir, Tacettin Dogacan Guney, Tolga Can, Integration of topological measures for eliminating non-specific interactions in protein interaction networks Discrete Applied Mathematics. ,vol. 157, pp. 2416- 2424 ,(2009) , 10.1016/J.DAM.2008.06.034