Comparison of High-Level Microarray Analysis Methods in the Context of Result Consistency

作者: Kornel Chrominski , Magdalena Tkacz

DOI: 10.1371/JOURNAL.PONE.0128845

关键词: BioinformaticsMicroarray analysis techniquesDNA microarrayContext (language use)GeneSet (abstract data type)Gene expressionGene chip analysisData miningRank (computer programming)Computer scienceMicroarrayGeneral Biochemistry, Genetics and Molecular BiologyGeneral Agricultural and Biological SciencesGeneral Medicine

摘要: Motivation When we were asked for help with high-level microarray data analysis (on Affymetrix HGU-133A microarray), faced the problem of selecting an appropriate method. We wanted to select a method that would yield "the best result" (detected as many "really" differentially expressed genes (DEGs) possible, without false positives and negatives). However, life scientists could not us – they use their "favorite" special argumentation. also did find any norm or recommendation. Therefore, decided examine it our own purpose. considered whether results obtained using different methods analyses Significant Analysis Microarrays, Rank Products, Bland-Altman, Mann-Whitney test, T test Linear Models Microarray Data be in agreement. Initially, conducted comparative on eight real sets from experiments (from Array Express database). The surprising. On same array set, set DEGs by significantly different. applied artificial determined some measures allow preparation overall scoring tested future recommendation. Results We found very low level concordance sets. number common all six fixed sets, checked sets) ranged 6 433 (22,283 total readings). Results better than those data. fully satisfying. scored accuracy, recall, precision, f-measure Matthews correlation coefficient. Based scoring, SAM LIMMA. TT acceptable. worst was MW. study, recommend: 1. Carefully taking into account need study when choosing method, 2. Making more one then only are (which seems reasonable) 3. Being careful (while summarizing facts) about genes: discover DEGs.

参考文章(22)
Robert Gentleman, Vincent J Carey, Wolfgang Huber, Rafael A Irizarry, Sandrine Dudoit, Bioinformatics and Computational Biology Solutions Using R and Bioconductor ,(2006)
G. K. Smyth, limma: Linear Models for Microarray Data Springer, New York, NY. pp. 397- 420 ,(2005) , 10.1007/0-387-29362-0_23
Michael C O'Neill, Li Song, Neural network analysis of lymphoma microarray data: prognosis and diagnosis near-perfect BMC Bioinformatics. ,vol. 4, pp. 13- 13 ,(2003) , 10.1186/1471-2105-4-13
Ola Larsson, Claes Wahlestedt, James A Timmons, Considerations when using the significance analysis of microarrays (SAM) algorithm BMC Bioinformatics. ,vol. 6, pp. 129- 129 ,(2005) , 10.1186/1471-2105-6-129
J. D. Storey, R. Tibshirani, Statistical significance for genomewide studies Proceedings of the National Academy of Sciences of the United States of America. ,vol. 100, pp. 9440- 9445 ,(2003) , 10.1073/PNAS.1530509100
Bryan AP Roxas, Qingbo Li, Significance analysis of microarray for relative quantitation of LC/MS data in proteomics BMC Bioinformatics. ,vol. 9, pp. 187- 187 ,(2008) , 10.1186/1471-2105-9-187
Giuseppe Jurman, Samantha Riccadonna, Cesare Furlanello, A Comparison of MCC and CEN Error Measures in Multi-Class Prediction PLoS ONE. ,vol. 7, pp. e41882- ,(2012) , 10.1371/JOURNAL.PONE.0041882
J. Martin Bland, DouglasG. Altman, Statistical methods for assessing agreement between two methods of clinical measurement. The Lancet. ,vol. 327, pp. 307- 310 ,(1986) , 10.1016/S0140-6736(86)90837-8
Rainer Breitling, Patrick Armengaud, Anna Amtmann, Pawel Herzyk, Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments ☆ FEBS Letters. ,vol. 573, pp. 83- 92 ,(2004) , 10.1016/J.FEBSLET.2004.07.055
Marine Jeanmougin, Aurelien de Reynies, Laetitia Marisa, Caroline Paccard, Gregory Nuel, Mickael Guedj, Should We Abandon the t-Test in the Analysis of Gene Expression Microarray Data: A Comparison of Variance Modeling Strategies PLoS ONE. ,vol. 5, pp. e12336- ,(2010) , 10.1371/JOURNAL.PONE.0012336