作者: Carson Holt , Mark Yandell
关键词:
摘要: Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes being sequenced and how they annotated. While the first generation genome projects focused on well-studied model organisms, many today's involve exotic organisms whose largely terra incognita. This complicates their annotation, because unlike first-generation projects, there no pre-existing 'gold-standard' gene-models which train gene-finders. Improvements in assembly wide availability mRNA-seq data also creating opportunities update re-annotate previously published annotations. Today's thus need new annotation tools that can meet challenges presented by second-generation technologies. We present MAKER2, a management tool designed for projects. MAKER2 is multi-threaded, parallelized application process datasets virtually any size. show produce accurate annotations novel where training-data limited, low quality or even non-existent. provides an easy means use improve quality; it these legacy annotations, significantly improving quality. evaluate identify prioritize problematic manual review. engine specifically scales size, requires little way training data, It manage datasets.