作者: Murat Ali Bayir , Ismail Hakki Toroslu , Murat Demirbas , Ahmet Cosar
DOI: 10.1016/J.DATAK.2011.11.005
关键词: Information retrieval 、 Graph (abstract data type) 、 Page view 、 Web access 、 Web page 、 Session ID 、 Computer science 、 Graph problem 、 World Wide Web 、 Static web page 、 Web navigation
摘要: In this paper, we propose a novel page view based session model and construction method to address the Web Usage Mining (WUM) problem. Unlike simple models, where sessions are sequences of web pages requested from server (or served browser/proxy cache) viewed in browser (which may not guarantee direct relationship between subsequent session), define more realistic which is set paths traversed graph that corresponds user navigation performed by following links on pages. We process raw logs as new problem present algorithm, Smart-SRA (Smart Session Reconstruction Algorithm), solve efficiently. An experimental evaluation data collected real access scenarios showed produces accurate than methods found literature.