作者: B R Prakash , M. Hanumanthappa
DOI:
关键词: Computer science 、 Algorithm 、 Snippet 、 Cluster analysis 、 Cluster labeling 、 GRASP 、 Information overload 、 Word-sense disambiguation 、 Document clustering 、 Information retrieval
摘要: Document clustering is an effective tool to manage information overload. By grouping similardocuments together, we enable a human observer quickly browse large document collections,make it possible easily grasp the distinct topics and subtopics. In this Paper survey most important problems techniques relatedto text retrieval: pre-processing filtering, word sense disambiguation,Further present using Modified FPF algorithm comparison of our algorithms against FPF, which isthe used in context. Further introducethe problem cluster labeling: Cluster labeling achieved by combining intra-clusterand