An Efficient Technique to Improve Snippet Clustering and Labeling using Modified FPF Algorithm

作者: B R Prakash , M. Hanumanthappa

DOI:

关键词: Computer scienceAlgorithmSnippetCluster analysisCluster labelingGRASPInformation overloadWord-sense disambiguationDocument clusteringInformation retrieval

摘要: Document clustering is an effective tool to manage information overload. By grouping similardocuments together, we enable a human observer quickly browse large document collections,make it possible easily grasp the distinct topics and subtopics. In this Paper survey most important problems techniques relatedto text retrieval: pre-processing filtering, word sense disambiguation,Further present using Modified FPF algorithm comparison of our algorithms against FPF, which isthe used in context. Further introducethe problem cluster labeling: Cluster labeling achieved by combining intra-clusterand

参考文章(8)
Karina Figueroa, Edgar Chávez, Gonzalo Navarro, Rodrigo Paredes, On the Least Cost for Proximity Searching in Metric Spaces Experimental Algorithms. pp. 279- 290 ,(2006) , 10.1007/11764298_26
Filippo Geraci, Mauro Leoncini, Manuela Montangero, Marco Pellegrini, M Elena Renda, None, FPF-SB: A Scalable Algorithm for Microarray Gene Expression Data Clustering Digital Human Modeling. ,vol. 4561, pp. 606- 615 ,(2007) , 10.1007/978-3-540-73321-8_69
Marco Maggini, Marco Pellegrini, Filippo Geraci, Fabrizio Sebastiani, Cluster Generation and Cluster Labelling for Web Snippets string processing and information retrieval. pp. 25- 36 ,(2006)
A. Gulli, P. Ferragina, A personalized search engine based on Web-snippet hierarchical clustering Software - Practice and Experience. ,vol. 38, pp. 189- 225 ,(2008) , 10.1002/SPE.V38:2
Flavio Chierichetti, Alessandro Panconesi, Prabhakar Raghavan, Mauro Sozio, Alessandro Tiberi, Eli Upfal, Finding near neighbors through cluster pruning symposium on principles of database systems. pp. 103- 112 ,(2007) , 10.1145/1265530.1265545
D. Crabtree, Xiaoying Gao, P. Andreae, Standardized Evaluation Method for Web Clustering Results web intelligence. pp. 280- 283 ,(2005) , 10.1109/WI.2005.138
Stanislaw Osiński, Dawid Weiss, Conceptual Clustering Using Lingo Algorithm: Evaluation on Open Directory Project Data intelligent information systems. pp. 369- 377 ,(2004) , 10.1007/978-3-540-39985-8_38