MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

作者: Carson Holt , Mark Yandell

DOI: 10.1186/1471-2105-12-491

关键词:

摘要: Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes being sequenced and how they annotated. While the first generation genome projects focused on well-studied model organisms, many today's involve exotic organisms whose largely terra incognita. This complicates their annotation, because unlike first-generation projects, there no pre-existing 'gold-standard' gene-models which train gene-finders. Improvements in assembly wide availability mRNA-seq data also creating opportunities update re-annotate previously published annotations. Today's thus need new annotation tools that can meet challenges presented by second-generation technologies. We present MAKER2, a management tool designed for projects. MAKER2 is multi-threaded, parallelized application process datasets virtually any size. show produce accurate annotations novel where training-data limited, low quality or even non-existent. provides an easy means use improve quality; it these legacy annotations, significantly improving quality. evaluate identify prioritize problematic manual review. engine specifically scales size, requires little way training data, It manage datasets.

参考文章(60)
Ian Korf, Gene finding in novel genomes BMC Bioinformatics. ,vol. 5, pp. 59- 59 ,(2004) , 10.1186/1471-2105-5-59
F. Legeai, S. Shigenobu, J.-P. Gauthier, J. Colbourne, C. Rispe, O. Collin, S. Richards, A. C. C. Wilson, T. Murphy, D. Tagu, AphidBase: a centralized bioinformatic resource for annotation of the pea aphid genome Insect Molecular Biology. ,vol. 19, pp. 5- 12 ,(2010) , 10.1111/J.1365-2583.2009.00930.X
Moisès Burset, Roderic Guigó, Evaluation of Gene Structure Prediction Programs Genomics. ,vol. 34, pp. 353- 367 ,(1996) , 10.1006/GENO.1996.0298
Chris R Smith, Christopher D Smith, Hugh M Robertson, Martin Helmkampf, Aleksey Zimin, Mark Yandell, Carson Holt, Hao Hu, Ehab Abouheif, Richard Benton, Elizabeth Cash, Vincent Croset, Cameron R Currie, Eran Elhaik, Christine G Elsik, Marie-Julie Favé, Vilaiwan Fernandes, Joshua D Gibson, Dan Graur, Wulfila Gronenberg, Kirk J Grubbs, Darren E Hagen, Ana Sofia Ibarraran Viniegra, Brian R Johnson, Reed M Johnson, Abderrahman Khila, Jay W Kim, Kaitlyn A Mathis, Monica C Munoz-Torres, Marguerite C Murphy, Julie A Mustard, Rin Nakamura, Oliver Niehuis, Surabhi Nigam, Rick P Overson, Jennifer E Placek, Rajendhran Rajakumar, Justin T Reese, Garret Suen, Shu Tao, Candice W Torres, Neil D Tsutsui, Lumi Viljakainen, Florian Wolschin, Jürgen Gadau, Draft genome of the red harvester ant Pogonomyrmex barbatus Proceedings of the National Academy of Sciences of the United States of America. ,vol. 108, pp. 5667- 5672 ,(2011) , 10.1073/PNAS.1007901108
J. Martin, S. Abubucker, T. Wylie, Y. Yin, Z. Wang, M. Mitreva, Nematode.net update 2008: improvements enabling more efficient data mining and comparative nematode genomics Nucleic Acids Research. ,vol. 37, pp. 571- 578 ,(2009) , 10.1093/NAR/GKN744
Roberto Bonasio, Guojie Zhang, Chaoyang Ye, Navdeep S Mutti, Xiaodong Fang, Nan Qin, Greg Donahue, Pengcheng Yang, Qiye Li, Cai Li, Pei Zhang, Zhiyong Huang, Shelley L Berger, Danny Reinberg, Jun Wang, Jürgen Liebig, None, Genomic Comparison of the Ants Camponotus floridanus and Harpegnathos saltator Science. ,vol. 329, pp. 1068- 1071 ,(2010) , 10.1126/SCIENCE.1192428
A. Bairoch, R. Apweiler, The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999. Nucleic Acids Research. ,vol. 27, pp. 49- 54 ,(1998) , 10.1093/NAR/27.1.49
Y. Wurm, J. Wang, O. Riba-Grognuz, M. Corona, S. Nygaard, B. G. Hunt, K. K. Ingram, L. Falquet, M. Nipitwattanaphon, D. Gotzek, M. B. Dijkstra, J. Oettler, F. Comtesse, C.-J. Shih, W.-J. Wu, C.-C. Yang, J. Thomas, E. Beaudoing, S. Pradervand, V. Flegel, E. D. Cook, R. Fabbretti, H. Stockinger, L. Long, W. G. Farmerie, J. Oakey, J. J. Boomsma, P. Pamilo, S. V. Yi, J. Heinze, M. A. D. Goodisman, L. Farinelli, K. Harshman, N. Hulo, L. Cerutti, I. Xenarios, D. Shoemaker, L. Keller, The genome of the fire ant Solenopsis invicta Proceedings of the National Academy of Sciences of the United States of America. ,vol. 108, pp. 5679- 5684 ,(2011) , 10.1073/PNAS.1009690108
Carol Soderlund, Anne Descour, Dave Kudrna, Matthew Bomhoff, Lomax Boyd, Jennifer Currie, Angelina Angelova, Kristi Collura, Marina Wissotski, Elizabeth Ashley, Darren Morrow, John Fernandes, Virginia Walbot, Yeisoo Yu, Sequencing, Mapping, and Analysis of 27,455 Maize Full-Length cDNAs PLoS Genetics. ,vol. 5, pp. e1000740- ,(2009) , 10.1371/JOURNAL.PGEN.1000740