Semantic Analysis of Web Site Audience by Integrating Web Usage Mining and Web Content Mining

作者: Jean-Pierre Norguet , Esteban Zimányi , Ralf Steinberger

DOI: 10.1007/978-3-540-88081-3_4

关键词:

摘要: With the emergence of World Wide Web, analyzing and improving Web communication has become essential to adapt content visitors’ expectations. analysis is traditionally performed by analytics software, which produce long lists page-based audience metrics. These results suffer from page synonymy, polysemy, temporality, volatility. In addition, metrics contain little semantics are too detailed be exploited organization managers chief editors, who need summarized conceptual information take high-level decisions. To obtain such metrics, we propose a method based on output mining. Output mining new kind usage mining, between our method, first collect pages server. Then, for given taxonomy covering site knwoledge domain, aggregate term weights in using OLAP tools, order topic-based representing topics. demonstrate how approach solves cited problems, compute with SQL Server Analysis Service prototype WASA real sites. Finally, compare against those obtained Google Analytics, popular tool.

参考文章(22)
Sebastían A. Ríos, Juan D. Velásquez, Eduardo S. Vera, Hiroshi Yasuda, Terumasa Aoki, Using SOFM to Improve Web Site Text Content Lecture Notes in Computer Science. ,vol. 3611, pp. 622- 626 ,(2005) , 10.1007/11539117_88
Howard Lombard, Mark Sweiger, Jimmy Langston, Mark R. Madsen, Clickstream Data Warehousing ,(2002)
Alexander Maedche, Gerd Stumme, FCA-MERGE: bottom-up merging of ontologies international joint conference on artificial intelligence. pp. 225- 230 ,(2001)
Elzbieta Malinowski, Esteban Zimányi, OLAP Hierarchies: A Conceptual Perspective conference on advanced information systems engineering. ,vol. 3084, pp. 477- 491 ,(2004) , 10.1007/978-3-540-25975-6_34
Peter L.T. Pirolli, James E. Pitkow, Distributions of surfers’ paths through the World Wide Web: Empirical characterizations World Wide Web. ,vol. 2, pp. 29- 45 ,(1999) , 10.1023/A:1019288403823
Jean-Pierre Norguet, Esteban Zimányi, Ralf Steinberger, Improving web sites with web usage mining, web content mining, and semantic analysis conference on current trends in theory and practice of informatics. pp. 430- 439 ,(2006) , 10.1007/11611257_41
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
Adolfo Lozano-Tello, Asunción Gomez-Perez, ONTOMETRIC: A Method to Choose the Appropriate Ontology Journal of Database Management. ,vol. 15, pp. 1- 18 ,(2004) , 10.4018/JDM.2004040101
Charu G. Aggarwal, Philip S. Yu, On disk caching of Web objects in proxy servers conference on information and knowledge management. pp. 238- 245 ,(1997) , 10.1145/266714.266904