The effect of alignment uncertainty, substitution models and priors in building and dating the mammal tree of life.

作者: Yan Du , Shaoyuan Wu , Scott V. Edwards , Liang Liu

DOI: 10.1186/S12862-019-1534-9

关键词:

摘要: The flood of genomic data to help build and date the tree life requires automation at several critical junctures, most importantly during sequence assembly alignment. It is widely appreciated that automated alignment protocols can yield inaccuracies, but relative impact various sources error on phylogenomic analysis not yet known. This study employs an updated mammal set 5162 coding loci sampled from 90 species evaluate effects uncertainty, substitution models, fossil priors gene tree, divergence time estimation. Additionally, a novel coalescent likelihood ratio test introduced for comparing competing trees against given trees. aligned DNA sequences were trimmed filtered using trimAL two filtering protocols. final dataset contains 4 sets alignments - before trimming, after by recently proposed pipeline, further ML each locus with concatenation tree. Our analyses suggest average discordance among significantly smaller than estimated or different models. There no significant difference times However, dates trimming are more recent those trimming. results highlight uncertainty choice models have little topologies yielded methods estimation, whereas they influential made concatenation. Given calibration scheme clock estimates robust removing deemed problematic algorithms lead dates. Although prior important in Bayesian this driven primarily data.

参考文章(80)
Zhenxiang Xi, Liang Liu, Charles C. Davis, Genes with minimal phylogenetic information are problematic for coalescent analyses when gene tree estimation is biased Molecular Phylogenetics and Evolution. ,vol. 92, pp. 63- 71 ,(2015) , 10.1016/J.YMPEV.2015.06.009
Liang Liu, Zhenxiang Xi, Shaoyuan Wu, Charles C. Davis, Scott V. Edwards, Estimating phylogenetic trees from genome-scale data. Annals of the New York Academy of Sciences. ,vol. 1360, pp. 36- 53 ,(2015) , 10.1111/NYAS.12747
T. Stadler, Mammalian phylogeny reveals recent diversification rate shifts Proceedings of the National Academy of Sciences of the United States of America. ,vol. 108, pp. 6187- 6192 ,(2011) , 10.1073/PNAS.1016876108
Peter Rice, Ian Longden, Alan Bleasby, EMBOSS: The European Molecular Biology Open Software Suite Trends in Genetics. ,vol. 16, pp. 276- 277 ,(2000) , 10.1016/S0168-9525(00)02024-2
T.-K. Seo, Calculating Bootstrap Probabilities of Phylogeny Using Multilocus Sequence Data Molecular Biology and Evolution. ,vol. 25, pp. 960- 971 ,(2008) , 10.1093/MOLBEV/MSN043
N. B. Mugridge, D. A. Morrison, T. Jäkel, A. R. Heckeroth, A. M. Tenter, A. M. Johnson, Effects of Sequence Alignment and Structural Domains of Ribosomal DNA on Phylogeny Reconstruction for the Protozoan Family Sarcocystidae Molecular Biology and Evolution. ,vol. 17, pp. 1842- 1853 ,(2000) , 10.1093/OXFORDJOURNALS.MOLBEV.A026285
Martin Wu, Sourav Chatterji, Jonathan A. Eisen, Accounting For Alignment Uncertainty in Phylogenomics PLoS ONE. ,vol. 7, pp. e30288- ,(2012) , 10.1371/JOURNAL.PONE.0030288
Liang Liu, Lili Yu, Dennis K. Pearl, Maximum tree: a consistent estimator of the species tree Journal of Mathematical Biology. ,vol. 60, pp. 95- 106 ,(2010) , 10.1007/S00285-009-0260-0