A Review on Web Pages Clustering Techniques

作者: Dipak Patel , Mukesh Zaveri

DOI: 10.1007/978-3-642-22543-7_72

关键词:

摘要: World Wide Web (WWW) has become largest source of information. This abundance information with dynamic and heterogeneous nature the web makes retrieval a difficult process for average user. A technique is required that can help users to organize, summarize browse available from goal satisfying their need effectively. Clustering organizes collection objects into related groups. page clustering key concept getting desired quickly massive storage pages on WWW. Many researchers have proposed various document techniques. In this paper, we present detail survey existing techniques along representation We also described some evaluation measures evaluate cluster qualities.

参考文章(29)
Saeed Setayeshi, Amir Masoud Rahmani, Zahra Hossaini, Link Processing for Fuzzy Web Pages Clustering and Classification ,(2009)
A. M. Natarajan, K. Premalatha, Procreant PSO for fastening the convergence to optimal solution in the application of document clustering Current Science. ,vol. 96, pp. 137- 143 ,(2009)
Ron Bekkerman, Shlomo Zilberstein, James Allan, Web page clustering using heuristic search in the web graph international joint conference on artificial intelligence. pp. 2280- 2285 ,(2007) , 10.21236/ADA457111
Junze Wang, Yijun Mo, Benxiong Huang, Jie Wen, Li He, Web Search Results Clustering Based on a Novel Suffix Tree Structure Lecture Notes in Computer Science. pp. 540- 554 ,(2008) , 10.1007/978-3-540-69295-9_43
Benjamin C. M. Fung, Martin Ester, Ke Wang, Hierarchical Document Clustering Using Frequent Itemsets siam international conference on data mining. pp. 59- 70 ,(2003)
William B. Cavnar, Using an N-Gram-based document representation with a vector processing retrieval model text retrieval conference. pp. 269- 277 ,(1994)
Cindy Xide Lin, Yintao Yu, Jiawei Han, Bing Liu, Hierarchical web-page clustering via in-page and cross-page link structures knowledge discovery and data mining. pp. 222- 229 ,(2010) , 10.1007/978-3-642-13672-6_22
Zhu Zhengyu, Han Ping, Li Lipei, Yu Chunlei, A dynamic genetic algorithm for clustering web pages international conference on software engineering. pp. 506- 511 ,(2010)
Zhong Su, Qiang Yang, Hongjiang Zhang, Xiaowei Xu, Yu-Hen Hu, Shaoping Ma, Correlation-Based Web Document Clustering for Adaptive Web Interface Design web information systems engineering. ,vol. 4, pp. 151- 167 ,(2002) , 10.1007/S101150200002
Richard C. Dubes, Anil K. Jain, Algorithms for clustering data ,(1988)