Inferring species trees from gene duplication episodes

作者: J. Gordon Burleigh , Mukul S. Bansal , Oliver Eulenstein , Todd J. Vision

DOI: 10.1145/1854776.1854806

关键词:

摘要: Gene tree parsimony, which infers a species that implies the fewest gene duplications across collection of trees, is method for inferring phylogenetic trees from paralogous genes. However, it assumes all are independent, and therefore, does not account large-scale duplication events like whole genome duplications. We describe two methods to infer based on may involve multiple First, episode parsimony seeks possible episodes. Second, adjusted corrects number at each node in by treating largest as single duplication. test both new methods, well using 7,091 representing 7 plant taxa. perform well, returning after an exhaustive search space. By contrast, fails rank true within top third topologies. Furthermore, with randomly permuted leaf labels can imply fewer episodes than correct labels. Adjusted reflects potentially more realistic and, least small data sets, computationally feasible model counting independently or minimizing

参考文章(37)
Mukul S. Bansal, Oliver Eulenstein, An Ω( n 2 / log n ) speed-up of TBR heuristics for the gene-duplication problem workshop on algorithms in bioinformatics. pp. 124- 135 ,(2007) , 10.1007/978-3-540-74126-8_12
André Wehe, John Gordon Burleigh, H AlMubaid, Scaling the Gene Duplication Problem Towards the Tree of Life. BICoB. pp. 133- 138 ,(2010)
Michael Fellows, Michael Hallett, Ulrike Stege, On the Multiple Gene Duplication Problem international symposium on algorithms and computation. pp. 347- 356 ,(1998) , 10.1007/3-540-49381-6_37
Mukul S. Bansal, J. Gordon Burleigh, Oliver Eulenstein, André Wehe, Heuristics for the gene-duplication problem: a Θ(n) speed-up for the local search research in computational molecular biology. pp. 238- 252 ,(2007) , 10.1007/978-3-540-71681-5_17
Stefanie Hartmann, Todd J Vision, Using ESTs for phylogenomics: can one accurately infer a phylogenetic tree from a gappy alignment? BMC Evolutionary Biology. ,vol. 8, pp. 95- 95 ,(2008) , 10.1186/1471-2148-8-95
James A. Cotton, Roderic D.M. Page, Gene tree parsimony vs uninode coding for phylogenetic reconstruction. Molecular Phylogenetics and Evolution. ,vol. 29, pp. 298- 308 ,(2003) , 10.1016/S1055-7903(03)00109-X
Cheng-Wei Luo, Ming-Chiang Chen, Yi-Ching Chen, Roger W L Yang, Hsiao-Fei Liu, Kun-Mao Chao, Linear-Time Algorithms for the Multiple Gene Duplication Problems IEEE/ACM Transactions on Computational Biology and Bioinformatics. ,vol. 8, pp. 260- 265 ,(2011) , 10.1109/TCBB.2009.52
Roderic D.M. Page, Extracting species trees from complex gene trees: reconciled trees and vertebrate phylogeny. Molecular Phylogenetics and Evolution. ,vol. 14, pp. 89- 106 ,(2000) , 10.1006/MPEV.1999.0676
David T. Jones, William R. Taylor, Janet M. Thornton, The rapid generation of mutation data matrices from protein sequences Bioinformatics. ,vol. 8, pp. 275- 282 ,(1992) , 10.1093/BIOINFORMATICS/8.3.275