Simulating DNA Coding Sequence Evolution with EvolveAGene 3

作者: B. G. Hall

DOI: 10.1093/MOLBEV/MSN008

关键词: Mutation (genetic algorithm)AlgorithmMolecular evolutionSelection (genetic algorithm)Base (topology)Models of DNA evolutionSequenceGeneticsBiologyIndelTransversion

摘要: Phylogenetic reconstruction based upon multiple alignments of molecular sequences is important to most branches modern biology and central evolution. Understanding the historical relationships among macromolecules depends computer programs that implement a variety analytical methods. Because it impossible know those with certainty, assessment accuracy methods them requires use realistically simulate evolution DNA sequences. EvolveAGene 3 realistic coding sequence simulation program separates mutation from selection allows user set conditions, including variable regions intensity within variation in over branches. Variation includes base substitutions, insertions, deletions. To best my knowledge, only available simulates intact Output true tree resulting corresponding protein A log file reports frequencies each kind substitution, ratio transition transversion indel substitution mutations, numbers silent amino acid replacement mutations. The realism data sets has been assessed by comparing d N /d S ratio, mutations simulated parameters real “gold standard” BaliBase collection structural alignments. Results show produced are very similar sets, therefore can be used evaluate

参考文章(23)
Philip A. Burns, Douglas F. Fix, Barry W. Glickman, Mechanisms of spontaneous mutagenesis: clues from altered mutational specificity in DNA repair-defective strains. Basic life sciences. ,vol. 39, pp. 259- 281 ,(1986) , 10.1007/978-1-4684-5182-5_23
P.E. Hartmann, T. Kada, D.M. Shankel, A. Hollaender, Antimutagenesis and Anticarcinogenesis Mechanisms ,(1986)
GPS Raghava, Stephen MJ Searle, Patrick C Audley, Jonathan D Barber, Geoffrey J Barton, None, OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracy BMC Bioinformatics. ,vol. 4, pp. 47- 47 ,(2003) , 10.1186/1471-2105-4-47
Michael S. Rosenberg, MySSP: Non-stationary evolutionary sequence simulation, including indels Evolutionary Bioinformatics. ,vol. 1, pp. 81- 83 ,(2005) , 10.4137/EBO.S0
Timo Lassmann, Erik L.L Sonnhammer, Quality assessment of multiple alignment programs. FEBS Letters. ,vol. 529, pp. 126- 130 ,(2002) , 10.1016/S0014-5793(02)03189-7
D. Hillis, J. Bull, M. White, M. Badgett, I. Molineux, Experimental phylogenetics: generation of a known phylogeny. Science. ,vol. 255, pp. 589- 592 ,(1992) , 10.1126/SCIENCE.1736360
B. G. Hall, Simple and accurate estimation of ancestral protein sequences Proceedings of the National Academy of Sciences of the United States of America. ,vol. 103, pp. 5431- 5436 ,(2006) , 10.1073/PNAS.0508991103
J. Stoye, D. Evers, F. Meyer, Rose: generating sequence families. Bioinformatics. ,vol. 14, pp. 157- 163 ,(1998) , 10.1093/BIOINFORMATICS/14.2.157
Maria Anisimova, Olivier Gascuel, Approximate likelihood-ratio test for branches : A fast, accurate, and powerful alternative Systematic Biology. ,vol. 55, pp. 539- 552 ,(2006) , 10.1080/10635150600755453