Controlling the number of false discoveries: application to high-dimensional genomic data

作者: Edward L Korn , James F Troendle , Lisa M McShane , Richard Simon

DOI: 10.1016/S0378-3758(03)00211-8

关键词:

摘要: Abstract Researchers conducting gene expression microarray experiments often are interested in identifying genes that differentially expressed between two groups of specimens. A straightforward approach to the identification such “differentially expressed” is perform a univariate analysis group mean differences for each gene, and then identify those most statistically significant. However, with large number typically represented on microarray, using nominal significance levels (unadjusted multiple comparisons) will lead many truly not expressed, “false discoveries.” reasonable strategy situations allow small false discoveries, or proportion identified be discoveries. Although previous work has considered control expected discoveries (commonly known as discovery rate), we show these methods may inadequate. We propose stepwise permutation-based procedures specified confidence actual approximately Limited simulation studies demonstrate substantial gain sensitivity detect even when allowing few one apply new analyze data set consisting measurements 9000 paired tumor specimens, collected both before after chemotherapy 20 breast cancer patients. The described broadly applicable problem which variables any measured differ pre-specified groups.

参考文章(12)
S. Ichikawa, Responses to Ionizing Radiation Springer Berlin Heidelberg. pp. 199- 228 ,(1981) , 10.1007/978-3-642-68090-8_8
M. Schena, D. Shalon, R. W. Davis, P. O. Brown, Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray Science. ,vol. 270, pp. 467- 470 ,(1995) , 10.1126/SCIENCE.270.5235.467
James G. Booth, P. H. Westfall, S. S. Young, Resampling-Based Multiple Testing. Journal of the American Statistical Association. ,vol. 89, pp. 354- ,(1994) , 10.2307/2291234
Paul Seeger, A Note on a Method for the Analysis of Significances en masse Technometrics. ,vol. 10, pp. 586- 593 ,(1968) , 10.1080/00401706.1968.10490605
Yee H. Yang, Sandrine Dudoit, Percy Luu, Terence P. Speed, Normalization for cDNA microarry data Microarrays : optical technologies and informatics. Conference. ,vol. 4266, pp. 141- 152 ,(2001) , 10.1117/12.427982
Charles M. Perou, Therese Sørlie, Michael B. Eisen, Matt van de Rijn, Stefanie S. Jeffrey, Christian A. Rees, Jonathan R. Pollack, Douglas T. Ross, Hilde Johnsen, Lars A. Akslen, Øystein Fluge, Alexander Pergamenschikov, Cheryl Williams, Shirley X. Zhu, Per E. Lønning, Anne-Lise Børresen-Dale, Patrick O. Brown, David Botstein, Molecular portraits of human breast tumours Nature. ,vol. 406, pp. 747- 752 ,(2000) , 10.1038/35021093
Yoav Benjamini, Yosef Hochberg, Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 57, pp. 289- 300 ,(1995) , 10.1111/J.2517-6161.1995.TB02031.X
Matthew J Callow, Sandrine Dudoit, Elaine L Gong, Terence P Speed, Edward M Rubin, Microarray Expression Profiling Identifies Genes with Altered Expression in HDL-Deficient Mice Genome Research. ,vol. 10, pp. 2022- 2029 ,(2000) , 10.1101/GR.10.12.2022
V. G. Tusher, R. Tibshirani, G. Chu, Significance analysis of microarrays applied to the ionizing radiation response Proceedings of the National Academy of Sciences of the United States of America. ,vol. 98, pp. 5116- 5121 ,(2001) , 10.1073/PNAS.091062498
Yidong Chen, Ratio-based decisions and the quantitative analysis of cDNA micro-array images Journal of Biomedical Optics. ,vol. 2, pp. 364- 374 ,(1999) , 10.1117/12.281504