Steady progress and recent breakthroughs in the accuracy of automated genome annotation

作者: Michael R. Brent

DOI: 10.1038/NRG2220

关键词:

摘要: The sequencing of large, complex genomes has become routine, but understanding how sequences relate to biological function is less straightforward. Although much attention focused on annotate genomic features such as developmental enhancers and non-coding RNAs, there still no higher eukaryote for which we know the correct exon-intron structure at least one ORF each gene. Despite this uncomfortable truth, genome annotation made remarkable progress since first drafts human were analysed. By combining several computational experimental methods, are now closer producing complete accurate gene catalogues than ever before.

参考文章(61)
Daniela S Gerhard, L Wagner, EA Feingold, CM Shenmen, LH Grouse, G Schuler, SL Klein, S Old, R Rasooly, P Good, M Guyer, AM Peck, JG Derge, D Lipman, FS Collins, W Jang, S Sherry, M Feolo, L Misquitta, E Lee, K Rotmistrovsky, SF Greenhut, CF Schaefer, K Buetow, TI Bonner, D Haussler, J Kent, M Kiekhaus, T Furey, M Brent, C Prange, K Schreiber, N Shapiro, NK Bhat, RF Hopkins, F Hsie, T Driscoll, MB Soares, TL Casavant, TE Scheetz, Brown-stein MJ, TB Usdin, S Toshiyuki, P Carninci, Y Piao, DB Dudekula, MS Ko, K Kawakami, Y Suzuki, S Sugano, CE Gruber, MR Smith, B Simmons, T Moore, R Waterman, SL Johnson, Y Ruan, CL Wei, S Mathavan, PH Gunaratne, J Wu, AM Garcia, SW Hulyk, E Fuh, Y Yuan, A Sneed, C Kowis, A Hodgson, DM Muzny, J McPherson, RA Gibbs, J Fahey, E Helton, M Ketteman, A Madan, S Rodrigues, A Sanchez, M Whiting, A Madari, AC Young, KD Wetherby, SJ Granite, PN Kwong, CP Brinkley, RL Pearson, GG Bouffard, RW Blakesly, ED Green, MC Dickson, AC Rodriguez, J Grimwood, J Schmutz, RM Myers, YS Butterfield, M Griffith, OL Griffith, MI Krzywinski, N Liao, R Morin, R Morrin, D Palmquist, AS Petrescu, U Skalska, DE Smailus, JM Stott, A Schnerch, JE Schein, SJ Jones, RA Holt, A Baross, MA Marra, S Clifton, KA Makowski, S Bosak, J Malek, The status, quality, and expansion of the NIH full-length cDNA project: The Mammalian Gene Collection (MGC) Genome Research. ,vol. 14, pp. 2121- 2127 ,(2004) , 10.1101/GR.2596504
Y. Shibata, P. Carninci, A. Watahiki, T. Shiraki, H. Konno, M. Muramatsu, Y. Hayashizaki, Cloning full-length, cap-trapper-selected cDNAs by using the single-strand linker ligation method. BioTechniques. ,vol. 30, pp. 1250- 1254 ,(2001) , 10.2144/01306ST01
Mario Stanke, Oliver Schöffmann, Burkhard Morgenstern, Stephan Waack, Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics. ,vol. 7, pp. 62- 62 ,(2006) , 10.1186/1471-2105-7-62
William A Moskal, Hank C Wu, Beverly A Underwood, Wei Wang, Christopher D Town, Yongli Xiao, Experimental validation of novel genes predicted in the un-annotated regions of the Arabidopsis genome BMC Genomics. ,vol. 8, pp. 18- 18 ,(2007) , 10.1186/1471-2164-8-18
Eduardo Eyras, Alexandre Reymond, Robert Castelo, Jacqueline M Bye, Francisco Camara, Paul Flicek, Elizabeth J Huckle, Genis Parra, David D Shteynberg, Carine Wyss, Jane Rogers, Stylianos E Antonarakis, Ewan Birney, Roderic Guigo, Michael R Brent, Gene finding in the chicken genome BMC Bioinformatics. ,vol. 6, pp. 131- 131 ,(2005) , 10.1186/1471-2105-6-131
Guy Slater, Ewan Birney, Automated generation of heuristics for biological sequence comparison BMC Bioinformatics. ,vol. 6, pp. 31- 31 ,(2005) , 10.1186/1471-2105-6-31
Asaf A Salamov, Victor V Solovyev, Ab initio Gene Finding in Drosophila Genomic DNA Genome Research. ,vol. 10, pp. 516- 522 ,(2000) , 10.1101/GR.10.4.516
M. Clamp, B. Fry, M. Kamal, X. Xie, J. Cuff, M. F. Lin, M. Kellis, K. Lindblad-Toh, E. S. Lander, Distinguishing protein-coding and noncoding genes in the human genome Proceedings of the National Academy of Sciences of the United States of America. ,vol. 104, pp. 19428- 19433 ,(2007) , 10.1073/PNAS.0709013104
M. R. Brent, Genome annotation past, present, and future: How to define an ORF at each locus Genome Research. ,vol. 15, pp. 1777- 1786 ,(2005) , 10.1101/GR.3866105