GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

作者: Jason S. Cumbie , Jeffrey A. Kimbrel , Yanming Di , Daniel W. Schafer , Larry J. Wilhelm

DOI: 10.1371/JOURNAL.PONE.0025279

关键词: RNA-SeqReference genomeDNA microarrayBiologyGene expression profilingComputational biologyGenomicsGeneRelational databaseGenomeGeneticsGeneral Biochemistry, Genetics and Molecular BiologyGeneral Agricultural and Biological SciencesGeneral Medicine

摘要: GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, applicable prokaryotes and non-model organisms without an available genome reference sequence. For alignments, configured CASHX, Bowtie, BWA, but end user can any Sequence Alignment/Map (SAM)-compliant program preference. To analyze expression, be run with one three statistics packages that are based on variations the negative binomial distribution. The default method new simple statistical test we developed over-parameterized version also includes different methods assessing differentially expressed features enriched ontology (GO) terms. Results transparent systematically stored MySQL relational database facilitate additional analyses as well quality assessment. We used next generation sequencing generate small-scale RNA-Seq dataset derived from heavily studied defense response Arabidopsis thaliana process data. Collectively, support analysis microarrays observed substantial overlap results each demonstrates suited handling unique characteristics small sample sizes high variability counts.

参考文章(54)
Samuel Fox, Sergei Filichkin, Todd C. Mockler, Applications of ultra-high-throughput sequencing. Methods of Molecular Biology. ,vol. 553, pp. 79- 108 ,(2009) , 10.1007/978-1-60327-563-7_5
Yanming Di, Daniel W Schafer, Jason S Cumbie, Jeff H Chang, The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq Statistical Applications in Genetics and Molecular Biology. ,vol. 10, pp. 1- 28 ,(2011) , 10.2202/1544-6115.1637
J. D. Storey, R. Tibshirani, Statistical significance for genomewide studies Proceedings of the National Academy of Sciences of the United States of America. ,vol. 100, pp. 9440- 9445 ,(2003) , 10.1073/PNAS.1530509100
E. Roine, W. Wei, J. Yuan, E.-L. Nurmiaho-Lassila, N. Kalkkinen, M. Romantschuk, S. Y. He, Hrp pilus: An hrp-dependent bacterial surface appendage produced by Pseudomonas syringae pv. tomato DC3000 Proceedings of the National Academy of Sciences of the United States of America. ,vol. 94, pp. 3459- 3464 ,(1997) , 10.1073/PNAS.94.7.3459
Zhong Wang, Mark Gerstein, Michael Snyder, RNA-Seq: a revolutionary tool for transcriptomics Nature Reviews Genetics. ,vol. 10, pp. 57- 63 ,(2009) , 10.1038/NRG2484
Adam Z Rosenthal, Eric G Matson, Avigdor Eldar, Jared R Leadbetter, RNA-seq reveals cooperative metabolic interactions between two termite-gut spirochete species in co-culture The ISME Journal. ,vol. 5, pp. 1133- 1142 ,(2011) , 10.1038/ISMEJ.2011.3
Lin Wang, Raka M Mitra, Keegan D Hasselmann, Masanao Sato, Lisa Lenarz-Wyatt, Jerry D Cohen, Fumiaki Katagiri, Jane Glazebrook, None, The genetic network controlling the Arabidopsis transcriptional response to Pseudomonas syringae pv. maculicola: roles of major regulators and the phytotoxin coronatine. Molecular Plant-microbe Interactions. ,vol. 21, pp. 1408- 1420 ,(2008) , 10.1094/MPMI-21-11-1408
Peter N. Dodds, John P. Rathjen, Plant immunity: towards an integrated view of plant–pathogen interactions Nature Reviews Genetics. ,vol. 11, pp. 539- 548 ,(2010) , 10.1038/NRG2812
Ben Langmead, Kasper D Hansen, Jeffrey T Leek, None, Cloud-scale RNA-sequencing differential expression analysis with Myrna Genome Biology. ,vol. 11, pp. 1- 11 ,(2010) , 10.1186/GB-2010-11-8-R83