作者: Estelle Proux-Wéra , David Armisén , Kevin P Byrne , Kenneth H Wolfe
关键词:
摘要: Background: Yeasts are a model system for exploring eukaryotic genome evolution. Next-generation sequencing technologies poised to vastly increase the number of yeast sequences, both from resequencing projects (population studies) and de novo (new species). However, annotation genomes presents major bottleneck projects, because it still relies on process that is largely manual. Results: Here we present Yeast Genome Annotation Pipeline (YGAP), an automated designed specifically new sequences lacking transcriptome data. YGAP does automatic annotation, exploiting homology synteny information other species stored in Gene Order Browser (YGOB) database. The basic premises underlying YGAP's approach data already tells us what genes should expect find any particular genomic region also orthologous likely have similar intron/exon structures. Additionally, able detect probable frameshift errors can propose corrections them. searches intelligently introns, detects tRNA Ty-like elements. Conclusions: In tests Saccharomyces cerevisiae Naumovozyma castellii Tetrapisispora blattae newly sequenced with Roche-454 technology, outperformed another popular program (AUGUSTUS). For S. N.castellii, 91-93% predicted gene structures were identical those previous manually curated sets. has been implemented as webserver user-friendly interface at http://wolfe.gen.tcd.ie/annotation.