作者: August Guang , Mark Howison , Felipe Zapata , Charles Lawrence , Casey Dunn
DOI: 10.1101/202416
关键词: Genetics 、 Phylogenetic tree 、 Transcriptome 、 Gene 、 Biology 、 Computational biology 、 Clade
摘要: One of the most common transcriptome assembly errors is to mistake different transcripts same gene as from multiple closely related genes. It difficult identify these during assembly, but in a phylogenetic analysis can be diagnosed trees containing clades tips species with improbably short branch lengths. treeinform module implemented Agalma1.0 that uses analyses across refine assemblies. identifies were incorrectly assigned genes and reassign them gene. Agalma1.0, available at https://bitbucket.org/caseywdunn/agalma. Supplementary information bioRxiv.