A survey of Web clustering engines

作者: Claudio Carpineto , Stanislaw Osiński , Giovanni Romano , Dawid Weiss

DOI: 10.1145/1541880.1541884

关键词:

摘要: Web clustering engines organize search results by topic, thus offering a complementary view to the flat-ranked list returned conventional engines. In this survey, we discuss issues that must be addressed in development of engine, including acquisition and preprocessing results, their visualization. Search clustering, core system, has specific requirements cannot classical algorithms. We emphasize role played quality cluster labels as opposed optimizing only structure. highlight main characteristics number existing also how evaluate retrieval performance. Some directions for future research are finally presented.

参考文章(115)
Andreas Hotho, Steffen Staab, Gerd Stumme, Explaining Text Clustering Results Using Semantic Structures european conference on principles of data mining and knowledge discovery. pp. 217- 228 ,(2003) , 10.1007/978-3-540-39804-2_21
Israel Z. Ben-Shaul, Yoelle S. Maarek, Dan Pelleg, Ronald Fagin, Ephemeral Document Clustering for Web Applications ,(2001)
Dino Pedreschi, Fosca Giannotti, F. Samaritani, Mirco Nanni, WebCat: Automatic Categorization of Web Search Results. SEBD. pp. 507- 518 ,(2003)
Vibhu O. Mittal, Behrang Mohit, Mark Kantrowitz, Stemming and its effects on TFIDF ranking. international acm sigir conference on research and development in information retrieval. pp. 357- 359 ,(2000)
Stanislaw Osinski, Improving Quality of Search Results Clustering with Approximate Matrix Factorisations Lecture Notes in Computer Science. pp. 167- 178 ,(2006) , 10.1007/11735106_16
Dawid Weiss, Jerzy Stefanowski, Web Search Results Clustering in Polish: Experimental Evaluation of Carrot intelligent information systems. pp. 209- 219 ,(2003) , 10.1007/978-3-540-36562-4_22
Dell Zhang, Yisheng Dong, Semantic, Hierarchical, Online Clustering of Web Search Results asia-pacific web conference. pp. 69- 78 ,(2004) , 10.1007/978-3-540-24655-8_8
Bernhard Ganter, Rudolf Wille, C. Franzke, Formal Concept Analysis: Mathematical Foundations ,(1998)
Dimitrios Pierrakos, Georgios Paliouras, Exploiting Probabilistic Latent Information for the Construction of Community Web Directories User Modeling 2005. pp. 89- 98 ,(2005) , 10.1007/11527886_13