Missing value estimation methods for DNA microarrays.

作者: Troyanskaya Olga , Cantor Michael , Shelock Gavin , Brown Pat , Hastie Trevor

DOI: 10.1093/BIOINFORMATICS/17.6.520

关键词:

摘要: Motivation: Gene expression microarray experiments can generate data sets with multiple missing expression values. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. For example, methods such as hierarchical clustering and K-means clustering are not robust to missing data, and may lose effectiveness even with a few missing values. Methods for imputing missing data are needed, therefore, to minimize the effect of incomplete data sets on analyses, and to …

参考文章(21)
Trevor Hastie, Robert Tibshirani, Michael B Eisen, Ash Alizadeh, Ronald Levy, Louis Staudt, Wing C Chan, David Botstein, Patrick Brown, 'Gene shaving' as a method for identifying distinct sets of genes with similar expression patterns Genome Biology. ,vol. 1, pp. 1- 21 ,(2000) , 10.1186/GB-2000-1-2-RESEARCH0003
Wei-Yin Loh, Nunta Vanichsetakul, Tree-Structured Classification via Generalized Discriminant Analysis Journal of the American Statistical Association. ,vol. 83, pp. 715- 725 ,(1988) , 10.1080/01621459.1988.10478652
Laurie J Heyer, Semyon Kruglyak, Shibu Yooseph, Exploring Expression Data: Identification and Analysis of Coexpressed Genes Genome Research. ,vol. 9, pp. 1106- 1115 ,(1999) , 10.1101/GR.9.11.1106
A. J. BUTTE, J. YE, H. U. HÄRING, M. STUMVOLL, M. F. WHITE, I. S. KOHANE, Determining significant fold differences in gene expression analysis. pacific symposium on biocomputing. pp. 6- 17 ,(2000) , 10.1142/9789814447362_0002
Roderick JA Little, Donald B Rubin, None, Statistical Analysis with Missing Data ,(1987)
Charles M. Perou, Therese Sørlie, Michael B. Eisen, Matt van de Rijn, Stefanie S. Jeffrey, Christian A. Rees, Jonathan R. Pollack, Douglas T. Ross, Hilde Johnsen, Lars A. Akslen, Øystein Fluge, Alexander Pergamenschikov, Cheryl Williams, Shirley X. Zhu, Per E. Lønning, Anne-Lise Børresen-Dale, Patrick O. Brown, David Botstein, Molecular portraits of human breast tumours Nature. ,vol. 406, pp. 747- 752 ,(2000) , 10.1038/35021093
Michael PS Brown, William Noble Grundy, David Lin, Nello Cristianini, Charles Walsh Sugnet, Terrence S Furey, Manuel Ares Jr, David Haussler, None, Knowledge-based analysis of microarray gene expression data by using support vector machines Proceedings of the National Academy of Sciences of the United States of America. ,vol. 97, pp. 262- 267 ,(2000) , 10.1073/PNAS.97.1.262
Paul T Spellman, Gavin Sherlock, Michael Q Zhang, Vishwanath R Iyer, Kirk Anders, Michael B Eisen, Patrick O Brown, David Botstein, Bruce Futcher, None, Comprehensive Identification of Cell Cycle–regulated Genes of the Yeast Saccharomyces cerevisiae by Microarray Hybridization Molecular Biology of the Cell. ,vol. 9, pp. 3273- 3297 ,(1998) , 10.1091/MBC.9.12.3273
T. R. Golub, D. K. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. P. Mesirov, H. Coller, M. L. Loh, J. R. Downing, M. A. Caligiuri, C. D. Bloomfield, E. S. Lander, Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. ,vol. 286, pp. 531- 537 ,(1999) , 10.1126/SCIENCE.286.5439.531