A Probabilistic Clustering Model for Variables of Mixed Type

作者: Johann Bacher

DOI: 10.1023/A:1004759101388

关键词: Probabilistic clusteringMathematicsStatisticsModel selectionLatent class modelProbabilistic latent semantic analysisProbabilistic relevance modelStatistical hypothesis testingLocal independenceLatent variable model

摘要: This paper develops a probabilistic clustering model for mixeddata. The allows analysis of variables mixed type: thevariables may be nominal, ordinal and/or quantitative. modelcontains the well-known models latent class as submodels.As in analysis, local independence isassumed. parameters are estimated by EMalgorithm. Test statistics and goodness-of-fit measures proposedfor selection. Two artificial data sets show usefulness ofthese tests. An empirical example completes presentation.

参考文章(10)
H. H. Bock, Probabilistic Aspects in Cluster Analysis Springer, Berlin, Heidelberg. pp. 12- 44 ,(1989) , 10.1007/978-3-642-75040-3_2
FRANK VAN DE POL, JAN DE LEEUW, A Latent Markov Model to Correct for Measurement Error Sociological Methods & Research. ,vol. 15, pp. 118- 141 ,(1986) , 10.1177/0049124186015001009
Richard C. Dubes, Anil K. Jain, Algorithms for clustering data ,(1988)
Peter G. Bryant, Large-sample results for optimization-based clustering methods Journal of Classification. ,vol. 8, pp. 31- 44 ,(1991) , 10.1007/BF02616246
Jürgen Rost, A latent class model for rating data Psychometrika. ,vol. 50, pp. 37- 49 ,(1985) , 10.1007/BF02294146
David Pollard, A Central Limit Theorem for $k$-Means Clustering Annals of Probability. ,vol. 10, pp. 919- 926 ,(1982) , 10.1214/AOP/1176993713
David Pollard, Strong Consistency of $K$-Means Clustering Annals of Statistics. ,vol. 9, pp. 135- 140 ,(1981) , 10.1214/AOS/1176345339
Robert R. Sokal, Brian Everitt, Cluster Analysis (2nd ed). Journal of the American Statistical Association. ,vol. 77, pp. 221- ,(1982) , 10.2307/2287808
P. K. Sen, Maurice Kendall, Multivariate Analysis (2nd ed.). Journal of the American Statistical Association. ,vol. 78, pp. 212- ,(1983) , 10.2307/2287151
Brian S. Everitt, Sabine Landau, Morven Leese, Cluster Analysis ,(1974)