Evaluation of read count based RNAseq analysis methods

作者: Yan Guo , Chung-I Li , Fei Ye , Yu Shyr

DOI: 10.1186/1471-2164-14-S8-S2

关键词:

摘要: RNAseq technology is replacing microarray as the tool of choice for gene expression profiling. While providing much richer data than microarray, analysis has been more challenging. To date, there not a consensus on best approach conducting robust analysis. In this study, we designed thorough experiment to evaluate six read count-based methods (DESeq, DEGseq, edgeR, NBPSeq, TSPM and baySeq) using both real simulated data. We found produce similar fold changes reasonable overlapping differentially expressed genes based p-values. However, all suffer from over-sensitivity. Based evaluation runtime area under receiver operating characteristic curve (AUC-ROC) data, that edgeR achieves better balance between speed accuracy other methods.

参考文章(14)
Paul L. Auer, Rebecca W Doerge, A Two-Stage Poisson Model for Testing RNA-Seq Data Statistical Applications in Genetics and Molecular Biology. ,vol. 10, pp. 1- 26 ,(2011) , 10.2202/1544-6115.1627
Yanming Di, Daniel W Schafer, Jason S Cumbie, Jeff H Chang, The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq Statistical Applications in Genetics and Molecular Biology. ,vol. 10, pp. 1- 28 ,(2011) , 10.2202/1544-6115.1637
Zhong Wang, Mark Gerstein, Michael Snyder, RNA-Seq: a revolutionary tool for transcriptomics Nature Reviews Genetics. ,vol. 10, pp. 57- 63 ,(2009) , 10.1038/NRG2484
Jay Shendure, The Beginning of the End for Microarrays Nature Methods. ,vol. 5, pp. 585- 587 ,(2008) , 10.1038/NMETH0708-585
Thomas J Hardcastle, Krystyna A Kelly, baySeq: Empirical Bayesian methods for identifying differential expression in sequence count data BMC Bioinformatics. ,vol. 11, pp. 422- 422 ,(2010) , 10.1186/1471-2105-11-422
Yan Guo, Quanhu Sheng, Jiang Li, Fei Ye, David C. Samuels, Yu Shyr, Large Scale Comparison of Gene Expression Levels by Microarrays and RNAseq Using TCGA Data PLoS ONE. ,vol. 8, pp. e71462- ,(2013) , 10.1371/JOURNAL.PONE.0071462
Mark D Robinson, Davis J McCarthy, Gordon K Smyth, None, edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. ,vol. 26, pp. 139- 140 ,(2010) , 10.1093/BIOINFORMATICS/BTP616
Likun Wang, Zhixing Feng, Xi Wang, Xiaowo Wang, Xuegong Zhang, DEGseq: an R package for identifying differentially expressed genes from RNA-seq data Bioinformatics. ,vol. 26, pp. 136- 138 ,(2010) , 10.1093/BIOINFORMATICS/BTP612
Cole Trapnell, Brian A Williams, Geo Pertea, Ali Mortazavi, Gordon Kwan, Marijke J van Baren, Steven L Salzberg, Barbara J Wold, Lior Pachter, Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation Nature Biotechnology. ,vol. 28, pp. 511- 515 ,(2010) , 10.1038/NBT.1621