Custom selected reference genes outperform pre-defined reference genes in transcriptomic analysis

作者: Karen Cristine Gonçalves dos Santos , Isabel Desgagné-Penix , Hugo Germain

DOI: 10.1186/S12864-019-6426-2

关键词: Reference genesDNA sequencingGenome projectFold changeTranscriptomeComputational biologyGene expressionDNA microarrayGeneBiology

摘要: RNA sequencing allows the measuring of gene expression at a resolution unmet by arrays or RT-qPCR. It is however necessary to normalize data library size, transcript size and composition, among other factors, before comparing levels. The use internal control genes spike-ins advocated in literature for scaling read counts, but methods choosing reference are mostly targeted RT-qPCR studies require set pre-selected candidate controls target genes. Here, we report an R-based pipeline select based solely on counts sizes. This novel method first normalizes Transcripts per Million (TPM) then excludes weakly expressed using DAFS script calculate cut-off. selects as references with lowest TPM covariance. We used this pick custom differential analysis three transcriptome sets from transgenic Arabidopsis plants expressing heterologous fungal effector proteins tagged GFP (using alone control). showed lower covariance fold change well broader range levels than commonly When analyzed NormFinder, both typical were considered suitable controls, selected more stably expressed. geNorm produced similar result which most ranked higher (i.e. expressed) proposed innovative, rapid simple. Since it does not depend genome annotation, can be any organism, candidates that always available.

参考文章(38)
Jo Vandesompele, Katleen De Preter, Filip Pattyn, Bruce Poppe, Nadine Van Roy, Anne De Paepe, Frank Speleman, Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes Genome Biology. ,vol. 3, pp. 1- 12 ,(2002) , 10.1186/GB-2002-3-7-RESEARCH0034
Lior Pachter, Models for transcript quantification from RNA-Seq arXiv: Genomics. ,(2011)
Günter P. Wagner, Koryu Kin, Vincent J. Lynch, Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory in Biosciences. ,vol. 131, pp. 281- 285 ,(2012) , 10.1007/S12064-012-0162-3
Jochen B. W. Wolf, Principles of transcriptome analysis and gene expression quantification: an RNA-seq tutorial. Molecular Ecology Resources. ,vol. 13, pp. 559- 572 ,(2013) , 10.1111/1755-0998.12109
Zhong Wang, Mark Gerstein, Michael Snyder, RNA-Seq: a revolutionary tool for transcriptomics Nature Reviews Genetics. ,vol. 10, pp. 57- 63 ,(2009) , 10.1038/NRG2484
U. Nagalakshmi, Z. Wang, K. Waern, C. Shou, D. Raha, M. Gerstein, M. Snyder, The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing Science. ,vol. 320, pp. 1344- 1349 ,(2008) , 10.1126/SCIENCE.1158441
Tao Qing, Ying Yu, TingTing Du, LeMing Shi, mRNA enrichment protocols determine the quantification characteristics of external RNA spike-in controls in RNA-Seq studies Science China Life Sciences. ,vol. 56, pp. 134- 142 ,(2013) , 10.1007/S11427-013-4437-9
Diana E. Jaalouk, Jan Lammerding, Mechanotransduction gone awry. Nature Reviews Molecular Cell Biology. ,vol. 10, pp. 63- 73 ,(2009) , 10.1038/NRM2597
L. Christiaen, B. Davidson, T. Kawashima, W. Powell, H. Nolla, K. Vranizan, M. Levine, The Transcription/Migration Interface in Heart Precursors of Ciona intestinalis Science. ,vol. 320, pp. 1349- 1352 ,(2008) , 10.1126/SCIENCE.1158170
Traver Hart, H Komori, Sarah LaMere, Katie Podshivalova, Daniel R Salomon, Finding the active genes in deep RNA-seq gene expression studies. BMC Genomics. ,vol. 14, pp. 778- 778 ,(2013) , 10.1186/1471-2164-14-778