Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome

作者: Jia Qian Wu , Jiang Du , Joel Rozowsky , Zhengdong Zhang , Alexander E Urban

DOI: 10.1186/GB-2008-9-1-R3

关键词: BiologyENCODEDNA microarrayGeneGeneticsHuman genomeNucleic acid amplification techniqueGenomeRapid amplification of cDNA endsTiling array

摘要: Background: Recent studies of the mammalian transcriptome have revealed a large number additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion genome transcribed, exact structures these novel transcripts, levels transcripts produced. Results: We interrogated loci 420 selected ENCyclopedia Of DNA Elements (ENCODE) using rapid amplification cDNA ends (RACE) sequencing. analyzed annotated known gene regions, but primarily we focused on transcriptionally active (TARs), which were previously identified by high-density oligonucleotide tiling arrays random that not believed to be transcribed. found RACE sequencing very sensitive able detect low specific cell types detectable microarrays. also observed many instances sense-antisense transcripts; further analysis suggests antisense (but all) may artifacts generated from reverse transcription reaction. Our results show majority TARs (60%) are connected other or exons. unannotated 17% shown produce overlapping transcripts. Furthermore, it estimated 9% encode proteins. Conclusion: conclude an efficient, sensitive, highly accurate method for characterization cell/tissue types. Using this method, appears represented polyA+ RNA. Moreover, fraction RNAs can protein likely functional.

参考文章(45)
Daniela S Gerhard, L Wagner, EA Feingold, CM Shenmen, LH Grouse, G Schuler, SL Klein, S Old, R Rasooly, P Good, M Guyer, AM Peck, JG Derge, D Lipman, FS Collins, W Jang, S Sherry, M Feolo, L Misquitta, E Lee, K Rotmistrovsky, SF Greenhut, CF Schaefer, K Buetow, TI Bonner, D Haussler, J Kent, M Kiekhaus, T Furey, M Brent, C Prange, K Schreiber, N Shapiro, NK Bhat, RF Hopkins, F Hsie, T Driscoll, MB Soares, TL Casavant, TE Scheetz, Brown-stein MJ, TB Usdin, S Toshiyuki, P Carninci, Y Piao, DB Dudekula, MS Ko, K Kawakami, Y Suzuki, S Sugano, CE Gruber, MR Smith, B Simmons, T Moore, R Waterman, SL Johnson, Y Ruan, CL Wei, S Mathavan, PH Gunaratne, J Wu, AM Garcia, SW Hulyk, E Fuh, Y Yuan, A Sneed, C Kowis, A Hodgson, DM Muzny, J McPherson, RA Gibbs, J Fahey, E Helton, M Ketteman, A Madan, S Rodrigues, A Sanchez, M Whiting, A Madari, AC Young, KD Wetherby, SJ Granite, PN Kwong, CP Brinkley, RL Pearson, GG Bouffard, RW Blakesly, ED Green, MC Dickson, AC Rodriguez, J Grimwood, J Schmutz, RM Myers, YS Butterfield, M Griffith, OL Griffith, MI Krzywinski, N Liao, R Morin, R Morrin, D Palmquist, AS Petrescu, U Skalska, DE Smailus, JM Stott, A Schnerch, JE Schein, SJ Jones, RA Holt, A Baross, MA Marra, S Clifton, KA Makowski, S Bosak, J Malek, The status, quality, and expansion of the NIH full-length cDNA project: The Mammalian Gene Collection (MGC) Genome Research. ,vol. 14, pp. 2121- 2127 ,(2004) , 10.1101/GR.2596504
Jia Qian Wu, Angela M. Garcia, Steven Hulyk, Anna Sneed, Carla Kowis, Ye Yuan, David Steffen, John D. McPherson, Preethi H. Gunaratne, Richard A. Gibbs, Large-scale RT-PCR recovery of full-length cDNA clones. BioTechniques. ,vol. 36, pp. 690- 700 ,(2004) , 10.2144/04364DD03
Petra Ross-Macdonald, Paulo SR Coelho, Terry Roemer, Seema Agarwal, Anuj Kumar, Ronald Jansen, Kei-Hoi Cheung, Amy Sheehan, Dawn Symoniatis, Lara Umansky, Matthew Heidtman, F Kenneth Nelson, Hiroshi Iwasaki, Karl Hager, Mark Gerstein, Perry Miller, G Shirleen Roeder, Michael Snyder, None, Large-scale analysis of the yeast genome by transposon tagging and gene disruption Nature. ,vol. 402, pp. 413- 418 ,(1999) , 10.1038/46558
Michelle L. Hastings, Hema A. Ingle, Mitchell A. Lazar, Stephen H. Munroe, Post-transcriptional Regulation of Thyroid Hormone Receptor Expression by cis-Acting Sequences and a Naturally Occurring Antisense RNA Journal of Biological Chemistry. ,vol. 275, pp. 11507- 11513 ,(2000) , 10.1074/JBC.275.15.11507
Yea Okazaki, M Furuno, T_ Kasukawa, J Adachi, H Bono, Shinji Kondo, I Nikaido, N Osato, Rintaro Saito, H Suzuki, I Yamanaka, H Kiyosawa, K Yagi, Y Tomaru, Y Hasegawa, A Nogami, Christian Schönbach, Takashi Gojobori, R Baldarelli, DP Hill, C Bult, DA Hume, J Quackenbush, LM Schriml, A Kanapin, H Matsuda, S Batalov, KW Beisel, JA Blake, D Bradt, V Brusic, C Chothia, LE Corbani, S Cousins, E Dalla, TA Dragani, CF Fletcher, A Forrest, KS Frazer, T Gaasterland, M Gariboldi, C Gissi, A Godzik, J Gough, S Grimmond, Stefano Gustincich, N Hirokawa, IJ Jackson, ED Jarvis, Akio Kanai, H Kawaji, Y Kawasawa, RM Kedzierski, BL King, A Konagaya, IV Kurochkin, Y Lee, B Lenhard, PA Lyons, DR Maglott, L Maltais, L Marchionni, L McKenzie, H Miki, T Nagashima, K Numata, T Okido, WJ Pavan, G Pertea, G Pesole, N Petrovsky, R Pillai, JU Pontius, D Qi, S Ramachandran, Timothy Ravasi, JC Reed, DJ Reed, J Reid, BZ Ring, M Ringwald, A Sandelin, C Schneider, CAM Semple, M Setou, K Shimada, R Sultana, Y Takenaka, MS Taylor, RD Teasdale, Masaru Tomita, R Verardo, L Wagner, C Wahlestedt, Y Wang, Y Watanabe, C Wells, LG Wilming, A Wynshaw-Boris, Masashi Yanagisawa, I Yang, L Yang, Z Yuan, M Zavolan, Y Zhu, A Zimmer, P Carninci, N Hayatsu, T Hirozane-Kishikawa, H Konno, M Nakamura, N Sakazume, K Sato, T Shiraki, K Waki, J Kawai, K Aizawa, T Arakawa, S Fukuda, A Hara, W Hashizume, K Imotani, Y Ishii, M Itoh, I Kagawa, A Miyazaki, K Sakai, D Sasaki, K Shibata, A Shinagawa, A Yasunishi, M Yoshino, R Waterston, ES Lander, J Rogers, E Birney, Y Hayashizaki, Fantom Consortium, RIKEN Genome Exploration Research Group Phase I & II Team, Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs Nature. ,vol. 420, pp. 563- 573 ,(2002) , 10.1038/NATURE01266
Christine Vanhée-Brossollet, Catherine Vaquero, Do natural antisense transcripts make sense in eukaryotes Gene. ,vol. 211, pp. 1- 9 ,(1998) , 10.1016/S0378-1119(98)00093-6
Warren Gish, David J. States, Identification of protein coding regions by database similarity search. Nature Genetics. ,vol. 3, pp. 266- 272 ,(1993) , 10.1038/NG0393-266
Andrew JP Docherty, Alan Lyons, Bryan J Smith, Edwina M Wright, Paul E Stephens, Tim JR Harris, Gillian Murphy, John J Reynolds, None, Sequence of human tissue inhibitor of metalloproteinases and its identity to erythroid-potentiating activity Nature. ,vol. 318, pp. 66- 69 ,(1985) , 10.1038/318066A0