Separating the swarm

作者: Jeffrey Heer , Ed H. Chi

DOI: 10.1145/503376.503420

关键词:

摘要: Understanding user behaviors on Web sites enables site owners to make more usable, ultimately helping users achieve their goals quickly. Accordingly, researchers have devised methods for categorizing sessions in hopes of revealing interests. These techniques build profiles by combining users' navigation paths with other data features, such as page viewing time, hyperlink structure, and content. Previously, we presented complex many these features cluster profiles. In this paper, introduce a study systematic evaluation different associated weighting schemes. We present the results our study, including accuracy measures number clustering approaches, offer recommendations analysts. While further investigation over is needed definitively settle robust scheme, characterized analytic space

参考文章(18)
JZ Huang, Dwl Cheung, KP Ng, WK Ching, KK Ng, A Cube Model for Web Access Sessions and Cluster Analysis ,(2001)
Robert Walker Cooley, Jaideep Srivastava, Web usage mining: discovery and application of interesting patterns from web data University of Minnesota. ,(2000)
Myra Spiliopoulou, Carsten Pohle, Lukas C. Faulstich, Improving the Effectiveness of a Web Site with Web Usage Mining Web Usage Analysis and User Profiling. pp. 142- 162 ,(2000) , 10.1007/3-540-44934-5_9
Joshua Zhexue Huang, Michael Ng, Wai-Ki Ching, Joe Ng, David Cheung, A Cube Model and Cluster Analysis for Web Access Sessions WEBKDD '01 Revised Papers from the Third International Workshop on Mining Web Log Data Across All Customers Touch Points. pp. 48- 67 ,(2001) , 10.1007/3-540-45640-6_3
Arindam Banerjee, Joydeep Ghosh, Clickstream clustering using weighted longest common subsequences Proceedings of the Web Mining Workshop at the 1st SIAM Conference on Data Mining. ,(2001)
Peter L.T. Pirolli, James E. Pitkow, Distributions of surfers’ paths through the World Wide Web: Empirical characterizations World Wide Web. ,vol. 2, pp. 29- 45 ,(1999) , 10.1023/A:1019288403823
Christopher D. Manning, Hinrich Schütze, Foundations of Statistical Natural Language Processing ,(1999)
Yongjian Fu, Kanwalpreet Sandhu, Ming-Yi Shih, A Generalization-Based Approach to Clustering of Web Usage Sessions Web Usage Analysis and User Profiling. pp. 21- 38 ,(2000) , 10.1007/3-540-44934-5_2
O.R. Zaiane, Man Xin, Jiawei Han, Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-. pp. 19- 29 ,(1998) , 10.1109/ADL.1998.670376
Peter Pirolli, James Pitkow, Mining longest repeating subsequences to predict world wide web surfing usenix symposium on internet technologies and systems. pp. 13- 13 ,(1999)