A clustering algorithm for asymmetrically related data with applications to text mining

作者: K. Krishna , Raghu Krishnapuram

DOI: 10.1145/502585.502694

关键词:

摘要: Clustering techniques find a collection of subsets data set such that the satisfies criterion is dependent on relation defined set. The underlying traditionally assumed to be symmetric. However, there exist many practical scenarios where asymmetric. One example an asymmetric in text analysis inclusion relation, i.e., meaning block another block. In this paper, we consider general problem clustering asymmetrically related and propose algorithm cluster data. To demonstrate its usefulness, two applications mining: (1) summarization short documents, (2) generation concept hierarchy from documents. Our experiments show performance proposed superior more traditional algorithms.

参考文章(4)
Gerard Salton, Michael J. McGill, Introduction to Modern Information Retrieval ,(1983)
William B. Frakes, Ricardo Baeza-Yates, Information Retrieval: Data Structures and Algorithms ,(1992)
Mark Sanderson, Bruce Croft, Deriving concept hierarchies from text international acm sigir conference on research and development in information retrieval. pp. 206- 213 ,(1999) , 10.1145/312624.312679
John A. Hartigan, Clustering Algorithms ,(1975)