Efficient incremental constrained clustering

作者: Ian Davidson , S. S. Ravi , Martin Ester

DOI: 10.1145/1281192.1281221

关键词:

摘要: Clustering with constraints is an emerging area of data mining research. However, most work assumes that the are given as one large batch. In this paper we explore situation where incrementally given. way user after seeing a clustering can provide positive and negative feedback via to critique solution. We consider problem efficiently updating satisfy new old rather than reclustering entire set. show incremental under NP-hard in general, but identify several sufficient conditions which lead solvable versions. These translate into set rules on types thatcan be added constraint properties must maintained. demonstrate approach more efficient re-clustering has other advantages.

参考文章(13)
S. S. Ravi, Ian Davidson, Identifying and generating easy sets of constraints for clustering national conference on artificial intelligence. pp. 336- 341 ,(2006)
Douglas Brent West, Introduction to Graph Theory ,(1995)
Rich Caruana, Andrew Kachites McCallum, David Cohn, Semi-Supervised Clustering with User Feedback Cornell University. ,(2003)
S. S. Ravi, Ian Davidson, Clustering with Constraints: Feasibility Issues and the k-Means Algorithm. siam international conference on data mining. pp. 138- 149 ,(2005)
Ian Davidson, S. S. Ravi, The complexity of non-hierarchical clustering with instance and cluster level constraints Data Mining and Knowledge Discovery. ,vol. 14, pp. 25- 61 ,(2007) , 10.1007/S10618-006-0053-7
William M. Rand, Objective Criteria for the Evaluation of Clustering Methods Journal of the American Statistical Association. ,vol. 66, pp. 846- 850 ,(1971) , 10.1080/01621459.1971.10482356
Sandy Irani, Coloring inductive graphs on-line Algorithmica. ,vol. 11, pp. 53- 72 ,(1994) , 10.1007/BF01294263
Ian Davidson, Kiri L. Wagstaff, Sugato Basu, Measuring Constraint-Set Utility for Partitional Clustering Algorithms Lecture Notes in Computer Science. pp. 115- 126 ,(2006) , 10.1007/11871637_15
Claire Cardie, Kiri Wagstaff, Clustering with Instance-level Constraints international conference on machine learning. pp. 1103- 1110 ,(2000)