Incorporating SAT solvers into hierarchical clustering algorithms

作者: Sean Gilpin , Ian Davidson

DOI: 10.1145/2020408.2020585

关键词:

摘要: The area of constrained clustering has been actively pursued for the last decade. A more recent extension that will be focus this paper is hierarchical which allows building user-constrained dendrograms/trees. Like all forms clustering, previous work on uses simple constraints are typically implemented in a procedural language. However, there exists mature results and packages fields constraint satisfaction languages solvers field yet to explore. This marks first steps towards introducing languages/solvers into clustering. We make several significant contributions. show how many existing new can modeled as Horn-SAT problem easily solvable polynomial time their implementation any number declarative or efficient solvers. implement our own solver efficiency reasons. then formulate flexible manner so algorithms, whose output dendrogram, use constraints.

参考文章(15)
S. S. Ravi, Leonid Shamis, Ian Davidson, A SAT-based Framework for Efficient Constrained Clustering. siam international conference on data mining. pp. 94- 105 ,(2010)
Korinna Bade, Andreas Nürnberger, Creating a Cluster Hierarchy under Constraints of a Partially Known Hierarchy. siam international conference on data mining. pp. 13- 24 ,(2008)
Ian Davidson, S. S. Ravi, Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results Knowledge Discovery in Databases: PKDD 2005. pp. 59- 70 ,(2005) , 10.1007/11564126_11
Hans A. Kestler, Johann M. Kraus, Günther Palm, Friedhelm Schwenker, On the Effects of Constraints in Semi-supervised Hierarchical Clustering Artificial Neural Networks in Pattern Recognition. pp. 57- 66 ,(2006) , 10.1007/11829898_6
Korinna Bade, Dominik Benz, Evaluation Strategies for Learning Algorithms of Hierarchies Advances in Data Analysis, Data Handling and Business Intelligence. pp. 83- 92 ,(2009) , 10.1007/978-3-642-01044-6_7
Richard M. Karp, Reducibility Among Combinatorial Problems Journal of Symbolic Logic. ,vol. 40, pp. 219- 241 ,(2010) , 10.1007/978-3-540-68279-0_8
Robert Endre Tarjan, Efficiency of a Good But Not Linear Set Union Algorithm Journal of the ACM. ,vol. 22, pp. 215- 225 ,(1975) , 10.1145/321879.321884
William F. Dowling, Jean H. Gallier, Linear-time algorithms for testing the satisfiability of propositional horn formulae Journal of Logic Programming. ,vol. 1, pp. 267- 284 ,(1984) , 10.1016/0743-1066(84)90014-1
Ian Davidson, S. S. Ravi, Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results Data Mining and Knowledge Discovery. ,vol. 18, pp. 257- 282 ,(2009) , 10.1007/S10618-008-0103-4