Cluster inference methods and graphical models evaluated on NCI60 microarray gene expression data.

作者: Hirohisa Kishino , Peter J. Waddell

DOI: 10.11234/GI1990.11.129

关键词: Tree (data structure)Graphical modelSampling distributionConditional probability distributionInferenceData miningBiologyPartial correlationCluster analysisLatent variable

摘要: At present, there is a lack of sound methodology to infer causal gene expression relationships on genome wide basis. We address this first by examining the behaviour some latest and fastest algorithms for tree cluster analysis, particularly hierarchical methods popular in phylogenetics. Combined with these are two novel distances based partial, rather than full, correlations. Theoretically, partial correlations should provide better evidence regulatory genetic links standard To compare clusters obtained many alternative we use consensus methods. analysis used partition metrics followed another level clustering. These, fit metric, all suggest that new give quite different trees those usually obtained. In second part consider graphical modeling interactions important genes cell cycle. Despite models seeming well occasions, despite experimental error structure close multivariate normal, considerable problems overcome. Latent variables, case missing from inferred have strong effect Also, data show clear sampling distributions conditional status cancer related genes, including TP53. Without full information which wild type appropriate cannot be fitted. These findings point need include distinguish not only relevant but also splice variants design phase microarray analysis. Failure do so will induce similar both latent variables distributions.

参考文章(17)
Manor Askenazi, Xiling Wen, Daniel B. Carr, Roland Somogyi, Stefanie Fuhrman, George S. Michaels, Cluster analysis and data visualization of large-scale gene expression data. pacific symposium on biocomputing. pp. 42- 53 ,(1998)
Dominic A. Scudiero, Kurt W. Kohn, John N. Weinstein, Stephen Friend, Joany Jackman, Albert J. Fornace, Timothy G. Myers, Masato Mutoh, Edward A. Sausville, Saijun Fan, Ann Monks, Patrick M. O'Connor, Insoo Bae, Characterization of the p53 Tumor Suppressor Pathway in Cell Lines of the National Cancer Institute Anticancer Drug Screen and Correlations with the Growth-Inhibitory Potency of 123 Anticancer Agents Cancer Research. ,vol. 57, pp. 4285- 4300 ,(1997)
Douglas T Ross, Uwe Scherf, Michael B Eisen, Charles M Perou, Christian Rees, Paul Spellman, Vishwanath Iyer, Stefanie S Jeffrey, Matt Van de Rijn, Mark Waltham, Alexander Pergamenschikov, JC Lee, Deval Lashkari, Dari Shalon, Timothy G Myers, John N Weinstein, David Botstein, Patrick O Brown, None, Systematic variation in gene expression patterns in human cancer cell lines. Nature Genetics. ,vol. 24, pp. 227- 235 ,(2000) , 10.1038/73432
Sergei P. Atamas, Alternative splice variants of cytokines: Making a list Life Sciences. ,vol. 61, pp. 1105- 1112 ,(1997) , 10.1016/S0024-3205(97)00243-9
Spyro Mousses, Hilmi Özçelik, Peter D.Lee, David Malkin, Shelley B.Bull, Irene L.Andrulis, Two variants of the CIP1/WAF1 gene occur together and are associated with human cancer Human Molecular Genetics. ,vol. 4, pp. 1089- 1092 ,(1995) , 10.1093/HMG/4.6.1089
Uwe Scherf, Douglas T. Ross, Mark Waltham, Lawrence H. Smith, Jae K. Lee, Lorraine Tanabe, Kurt W. Kohn, William C. Reinhold, Timothy G. Myers, Darren T. Andrews, Dominic A. Scudiero, Michael B. Eisen, Edward A. Sausville, Yves Pommier, David Botstein, Patrick O. Brown, John N. Weinstein, A gene expression database for the molecular pharmacology of cancer Nature Genetics. ,vol. 24, pp. 236- 244 ,(2000) , 10.1038/73439
J. Yu, L. Zhang, P. M. Hwang, C. Rago, K. W. Kinzler, B. Vogelstein, Identification and classification of p53-regulated genes Proceedings of the National Academy of Sciences of the United States of America. ,vol. 96, pp. 14517- 14522 ,(1999) , 10.1073/PNAS.96.25.14517
D.F. Robinson, L.R. Foulds, Comparison of phylogenetic trees Mathematical Biosciences. ,vol. 53, pp. 131- 147 ,(1981) , 10.1016/0025-5564(81)90043-2
Nir Friedman, Michal Linial, Iftach Nachman, Dana Pe'er, Using Bayesian networks to analyze expression data research in computational molecular biology. pp. 127- 135 ,(2000) , 10.1145/332306.332355