Tree-Structured Clustering via the Minimum Cross Entropy Principle

作者: David Miller , Kenneth Rose

DOI: 10.1007/978-94-015-8729-7_7

关键词:

摘要: We propose a new interdisciplinary approach to the tree-structured clustering problem, wherein structural constraints are imposed in order reduce classification search complexity of resulting statistical classifier. Most known methods greedy and optimize nodes tree one at time minimize local cost. By constrast, we develop joint optimization method, derived based on information-theoretic principles closely related physics. The is inspired by deterministic annealing method for unstructured clustering, which was maximum entropy inference. principle minimum cross entropy, using informative priors approximate solution while imposing constraint. As original number distinct representatives (and hence tree) grows non-heuristic fashion sequence phase transitions occur so as effective free energy Examples demonstrate considerable improvement over methods.

参考文章(26)
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Biing-Hwang Juang, A. Gray, Multiple stage vector quantization for speech coding international conference on acoustics, speech, and signal processing. ,vol. 7, pp. 597- 600 ,(1982) , 10.1109/ICASSP.1982.1171604
Richard C. Dubes, Anil K. Jain, Algorithms for clustering data ,(1988)
E.A. Riskin, R.M. Gray, A greedy tree growing algorithm for the design of variable rate vector quantizers (image compression) IEEE Transactions on Signal Processing. ,vol. 39, pp. 2500- 2507 ,(1991) , 10.1109/78.98004
James C. Bezdek, A Convergence Theorem for the Fuzzy ISODATA Clustering Algorithms IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 2, pp. 1- 8 ,(1980) , 10.1109/TPAMI.1980.4766964
David Miller, Kenneth Rose, Hierarchical, unsupervised learning with growing via phase transitions Neural Computation. ,vol. 8, pp. 425- 450 ,(1996) , 10.1162/NECO.1996.8.2.425
Geoffrey H. Ball, David J. Hall, A clustering technique for summarizing multivariate data Behavioral Science. ,vol. 12, pp. 153- 155 ,(1967) , 10.1002/BS.3830120210
Petar D Simic, Statistical mechanics as the underlying theory of ‘elastic’ and ‘neural’ optimisations Network: Computation In Neural Systems. ,vol. 1, pp. 89- 103 ,(1990) , 10.1088/0954-898X_1_1_007
D. Geiger, F. Girosi, Parallel and deterministic algorithms from MRFs: surface reconstruction IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 13, pp. 401- 412 ,(1991) , 10.1109/34.134040