Bayesian model-averaging in unsupervised learning from microarray data

作者: Mario Medvedovic , Junhai Guo

DOI:

关键词:

摘要: Unsupervised identification of patterns in microarray data has been a productive approach to uncovering relationships between genes and the biological process which they are involved. Traditional model-based clustering approaches as well some recently developed mining for integrating genomic functional rely on one's ability determine correct number clusters or modules data. In this paper we demonstrate that performance such methods general can be significantly improved by accounting uncertainties inherent identifying optimal We Bayesian averaging via infinite mixture model offers more robust than traditional finite is determined using Information Criterion. This improvement demonstrated through simulation study analysis relatively large dataset. Finally, describe novel heuristic modification Gibbs sampler used fit mode effectively deals with issues slow mixing.

参考文章(24)
Chris Fraley, Adrian E Raftery, Model-Based Clustering, Discriminant Analysis, and Density Estimation Journal of the American Statistical Association. ,vol. 97, pp. 611- 631 ,(2002) , 10.1198/016214502760047131
Justin Lamb, Sridhar Ramaswamy, Heide L. Ford, Bernardo Contreras, Robert V. Martinez, Frances S. Kittrell, Cynthia A. Zahnow, Nick Patterson, Todd R. Golub, Mark E. Ewen, A mechanism of cyclin D1 action encoded in the patterns of gene expression in human cancer. Cell. ,vol. 114, pp. 323- 334 ,(2003) , 10.1016/S0092-8674(03)00570-1
Gilles Celeux, Merrilee Hurn, Christian P. Robert, Computational and Inferential Difficulties with Mixture Posterior Distributions Journal of the American Statistical Association. ,vol. 95, pp. 957- 970 ,(2000) , 10.1080/01621459.2000.10474285
Thomas S. Ferguson, A Bayesian Analysis of Some Nonparametric Problems Annals of Statistics. ,vol. 1, pp. 209- 230 ,(1973) , 10.1214/AOS/1176342360
Mario Medvedovic, Paul Succop, Rakesh Shukla, Kathleen Dixon, Clustering mutational spectra via classification likelihood and markov chain monte carlo algorithms Journal of Agricultural Biological and Environmental Statistics. ,vol. 6, pp. 19- 37 ,(2001) , 10.1198/108571101300325111
Radford M. Neal, Markov Chain Sampling Methods for Dirichlet Process Mixture Models Journal of Computational and Graphical Statistics. ,vol. 9, pp. 249- 265 ,(2000) , 10.1080/10618600.2000.10474879
Alan E. Gelfand, Adrian F. M. Smith, Sampling-Based Approaches to Calculating Marginal Densities Journal of the American Statistical Association. ,vol. 85, pp. 398- 409 ,(1990) , 10.1080/01621459.1990.10476213
Sabine Fessele, Holger Maier, Christian Zischek, Peter J Nelson, Thomas Werner, Regulatory context is a crucial part of gene function. Trends in Genetics. ,vol. 18, pp. 60- 63 ,(2002) , 10.1016/S0168-9525(02)02591-X
M. Medvedovic, K.Y. Yeung, R.E. Bumgarner, Bayesian mixture model based clustering of replicated microarray data Bioinformatics. ,vol. 20, pp. 1222- 1232 ,(2004) , 10.1093/BIOINFORMATICS/BTH068
E. Segal, R. Yelensky, D. Koller, Genome-wide discovery of transcriptional modules from DNA sequence and gene expression. Bioinformatics. ,vol. 19, pp. 273- 282 ,(2003) , 10.1093/BIOINFORMATICS/BTG1038