Bayesian nonparametric clustering as a community detection problem

作者: Stefano F. Tonellato

DOI: 10.1016/J.CSDA.2020.107044

关键词:

摘要: Abstract A wide class of Bayesian nonparametric priors leads to the representation distribution observable variables as a mixture density with an infinite number components. Such induces clustering structure in data. However, due label switching, cluster identification is not straightforward posteriori and some post-processing MCMC output usually required. Alternatively, observations can be mapped on weighted undirected graph, where each node represents sample item edge weights are given by posterior pairwise similarities. It shown how, after building particular random walk such it possible apply community detection algorithm, known map equation, leading minimisation expected description length partition. relevant feature this method that allows for quantification uncertainty classification.

参考文章(53)
Mario Medvedovic, Junhai Guo, Bayesian model-averaging in unsupervised learning from microarray data international conference on data mining. pp. 40- 47 ,(2004)
Tamás Nepusz, Gábor Csárdi, The igraph software package for complex network research InterJournal Complex Systems. ,vol. 1695, ,(2006)
Zoubin Ghahramani, Sara Wade, Bayesian Cluster Analysis: Point Estimation and Credible Balls Bayesian Analysis. ,vol. 13, pp. 47- 49 ,(2017) , 10.1214/17-BA1073
Sylvia Frühwirth-Schnatter, Finite Mixture and Markov Switching Models ,(2006)
Murray Aitkin, How many Components in a Finite Mixture John Wiley & Sons, Ltd. pp. 277- 292 ,(2011) , 10.1002/9781119995678.CH13
J McLachlan, G, D. Peel, Finite Mixture Models ,(2000)
Willem Waegeman, Eyke Hüllermeier, Arkadiusz Jachnik, Weiwei Cheng, Krzysztof Dembczyńki, On the bayes-optimality of F-measure maximizers Journal of Machine Learning Research. ,vol. 15, pp. 3333- 3388 ,(2014) , 10.5555/2627435.2697071
Jean-Patrick Baudry, Estimation and model selection for model-based clustering with the conditional classification likelihood Electronic Journal of Statistics. ,vol. 9, pp. 1041- 1077 ,(2015) , 10.1214/15-EJS1026
Alejandro Jara, Timothy Hanson, Fernando Quintana, Peter Müller, Gary Rosner, DPpackage: Bayesian Semi- and Nonparametric Modeling in R Journal of Statistical Software. ,vol. 40, pp. 1- 30 ,(2011) , 10.18637/JSS.V040.I05