Determining the evolutionary history of gene families

作者: Ryan M. Ames , Daniel Money , Vikramsinh P. Ghatge , Simon Whelan , Simon C. Lovell

DOI: 10.1093/BIOINFORMATICS/BTR592

关键词: Ancestral reconstructionCopy-number variationPhylogenetic treeStatisticsPopulationBiologyMarkov modelMarkov chainGene familyVariation (game tree)

摘要: Motivation: Recent large-scale studies of individuals within a population have demonstrated that there is widespread variation in copy number many gene families. In addition, increasing evidence the can give rise to substantial phenotypic effects. some cases, these variations been shown be adaptive. These observations show full understanding evolution biological function requires an gain and loss. Accurate, robust evolutionary models loss events are, therefore, required. Results: We developed weighted parsimony maximum likelihood methods for inferring events. To test methods, we used Markov simulate data with known properties. examine three models: simple birth–death model, single rate model innovation parameters estimated from Drosophila genome data. find all simulations likelihood-based are very accurate reconstructing duplication on phylogenetic tree, similar accuracy ancestral state. Our implementations different provide inferences states For reconstruction, recommend because it has likelihood, but much faster. individual or events, noticeably more accurate, albeit at greater computational cost. Availability: www.bioinf.manchester.ac.uk/dupliphy Contact:simon.lovell@manchester.ac.uk; simon.whelan@manchester.ac.uk Supplementary information:Supplementary available Bioinformatics online.

参考文章(53)
Daniel A Pollard, Venky N Iyer, Alan M Moses, Michael B Eisen, Widespread Discordance of Gene Trees with Species Tree in Drosophila: Evidence for Incomplete Lineage Sorting PLOS Genetics. ,vol. 2, ,(2006) , 10.1371/JOURNAL.PGEN.0020173
Michael Lynch, John S Conery, The Evolutionary Fate and Consequences of Duplicate Genes Science. ,vol. 290, pp. 1151- 1155 ,(2000) , 10.1126/SCIENCE.290.5494.1151
Shalini Nair, Becky Miller, Marion Barends, Anchalee Jaidee, Jigar Patel, Mayfong Mayxay, Paul Newton, François Nosten, Michael T. Ferdig, Tim J. C. Anderson, Adaptive Copy Number Evolution in Malaria Parasites PLoS Genetics. ,vol. 4, pp. e1000243- ,(2008) , 10.1371/JOURNAL.PGEN.1000243
Tal Pupko, Itsik Pe, Ron Shamir, Dan Graur, A Fast Algorithm for Joint Reconstruction of Ancestral Amino Acid Sequences Molecular Biology and Evolution. ,vol. 17, pp. 890- 896 ,(2000) , 10.1093/OXFORDJOURNALS.MOLBEV.A026369
Daniel J Turner, Marcos Miretti, Diana Rajan, Heike Fiegler, Nigel P Carter, Martyn L Blayney, Stephan Beck, Matthew E Hurles, Germline rates of de novo meiotic deletions and duplications causing several genomic disorders Nature Genetics. ,vol. 40, pp. 90- 95 ,(2008) , 10.1038/NG.2007.40
Gavin C Conant, Andreas Wagner, GenomeHistory: a software tool and its application to fully sequenced genomes Nucleic Acids Research. ,vol. 30, pp. 3378- 3386 ,(2002) , 10.1093/NAR/GKF449
S Altschula, Warren Gisha, Webb Millerb, E Meyersc, D Lipmana, None, Basic Local Alignment Search Tool Journal of Molecular Biology. ,vol. 215, pp. 403- 410 ,(1990) , 10.1016/S0022-2836(05)80360-2
Chris M Egan, Srinath Sridhar, Michael Wigler, Ira M Hall, Recurrent DNA copy number variation in the laboratory mouse. Nature Genetics. ,vol. 39, pp. 1384- 1389 ,(2007) , 10.1038/NG.2007.19
Kevin Chen, Dannie Durand, Martin Farach-Colton, NOTUNG: a program for dating gene duplications and optimizing gene family trees. research in computational molecular biology. ,vol. 7, pp. 429- 447 ,(2000) , 10.1089/106652700750050871
David Sankoff, Minimal Mutation Trees of Sequences Siam Journal on Applied Mathematics. ,vol. 28, pp. 35- 42 ,(1975) , 10.1137/0128004