Phylogenomics from Whole Genome Sequences Using aTRAM.

作者: Julie M Allen , Bret Boyd , Nam-Phuong Nguyen , Pranjal Vachaspati , Tandy Warnow

DOI: 10.1093/SYSBIO/SYW105

关键词: GenomicsBiologyComputational biologyPhylogenomicsGenomePhylogenetic treeSequence analysisWhole genome sequencingDNA sequencingContig

摘要: Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently most commonly used phylogenomic approaches involve some form genome reduction. While these make assembling more economical for organisms with large genomes, they reduce genomic coverage and thereby long-term utility data. Currently, moderate small genomes ($<$1000 Mbp) it is feasible sequence entire at modest ($10-30\times$). Computational challenges handling alleviated by targeted reads, rather than genome, produce a matrix. Here we demonstrate use automated Target Restricted Assembly Method (aTRAM) assemble 1107 single-copy ortholog genes from whole sucking lice (Anoplura) out-groups. We developed pipeline extract exon sequences aTRAM assemblies annotating them respect original target protein. aligned protein inferred amino acids then performed analyses on both concatenated matrix each gene separately in coalescent analysis. Finally, tested limits successful assembly 100 close- distantly related taxa high low levels coverage.Both analysis coalescent-based produced same tree topology, which was consistent previously published results resolved weakly supported nodes. These this approach developing raw reads. Further, found coverages above $5-10\times$, 80-90% contigs close taxa. As costs continue decline, expect full will become wider array organisms, enable mining an extensive variety applications, including phylogenomics. [aTRAM; assembly; sequencing; phylogenomics.].

参考文章(46)
Robert Evans Snodgrass, The feeding apparatus of biting and sucking insects affecting man and animals Miscellaneous Publications. ,vol. 104, ,(1944)
Nam-phuong D. Nguyen, Siavash Mirarab, Keerthana Kumar, Tandy Warnow, Ultra-large alignments using phylogeny-aware profiles Genome Biology. ,vol. 16, pp. 124- 124 ,(2015) , 10.1186/S13059-015-0688-Z
Guy Slater, Ewan Birney, Automated generation of heuristics for biological sequence comparison BMC Bioinformatics. ,vol. 6, pp. 31- 31 ,(2005) , 10.1186/1471-2105-6-31
Richard O. Prum, Jacob S. Berv, Alex Dornburg, Daniel J. Field, Jeffrey P. Townsend, Emily Moriarty Lemmon, Alan R. Lemmon, A comprehensive phylogeny of birds (Aves) using targeted next-generation DNA sequencing Nature. ,vol. 526, pp. 569- 573 ,(2015) , 10.1038/NATURE15697
Daniel R. Zerbino, Using the Velvet de novo Assembler for Short-Read Sequencing Technologies Current protocols in human genetics. ,vol. 31, ,(2010) , 10.1002/0471250953.BI1105S31
Jean-Christophe Camus, Melinda J. Pryor, Claudine Médigue, Stewart T. Cole, Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv Microbiology. ,vol. 148, pp. 2967- 2973 ,(2002) , 10.1099/00221287-148-10-2967
Mark Howison, Felipe Zapata, Erika J. Edwards, Casey W. Dunn, Bayesian Genome Assembly and Assessment by Markov Chain Monte Carlo Sampling PLoS ONE. ,vol. 9, pp. e99497- ,(2014) , 10.1371/JOURNAL.PONE.0099497
J. T. Simpson, K. Wong, S. D. Jackman, J. E. Schein, S. J.M. Jones, I. Birol, ABySS: A parallel assembler for short read sequence data Genome Research. ,vol. 19, pp. 1117- 1123 ,(2009) , 10.1101/GR.089532.108
Roger S Lasken, Timothy B Stockwell, Mechanism of chimera formation during the Multiple Displacement Amplification reaction BMC Biotechnology. ,vol. 7, pp. 19- 19 ,(2007) , 10.1186/1472-6750-7-19