Combining phylogenetic and hidden Markov models in biosequence analysis

作者: Adam Siepel , David Haussler

DOI: 10.1145/640075.640111

关键词:

摘要: A few models have appeared in recent years that consider not only the way substitutions occur through evolutionary history at each site of a genome, but also process changes from one to next. These combine phylogenetic molecular evolution, which apply individual sites, and hidden Markov models, allow for site. Besides improving realism ordinary they are potentially very powerful tools inference prediction---for gene finding, example, or prediction secondary structure. In this paper, we review progress on combined present some extensions previous work. Our main result is simple efficient method accommodating higher-order states HMM, allows context-sensitive substitution---that is, effects neighboring bases pattern substitution. We experimental results indicating states, autocorrelated rates, multiple functional categories all lead significant improvements fit model, with effect being particularly pronounced.

参考文章(59)
David Haussler, Martin G. Reese, David Kulp, Frank H. Eeckman, A Generalized Hidden Markov Model for the Recognition of Human Genes in DNA intelligent systems in molecular biology. ,vol. 4, pp. 134- 142 ,(1996)
Elizabeth J. B. Williams, Laurence D. Hurst, The proteins of linked genes evolve at similar rates. Nature. ,vol. 407, pp. 900- 903 ,(2000) , 10.1038/35038066
Yair Weiss, Kevin P. Murphy, Michael I. Jordan, Loopy belief propagation for approximate inference: an empirical study uncertainty in artificial intelligence. pp. 467- 475 ,(1999)
Jonathan S. Yedidia, Yair Weiss, William T. Freeman, Bethe free energy, Kikuchi approximations, and belief propagation algorithms ,(2001)
Anders Krogh, Two Methods for Improving Performance of a HMM and their Application for Gene Finding intelligent systems in molecular biology. ,vol. 5, pp. 179- 186 ,(1997)
Kenneth Lange, Kenneth Lange, Reconstruction of Evolutionary Trees Mathematical and Statistical Methods for Genetic Analysis. pp. 164- 182 ,(1997) , 10.1007/978-1-4757-2739-5_10
Nick Goldman, Ziheng Yang, A codon-based model of nucleotide substitution for protein-coding DNA sequences. Molecular Biology and Evolution. ,vol. 11, pp. 725- 736 ,(1994) , 10.1093/OXFORDJOURNALS.MOLBEV.A040153
A Rzhetsky, Estimating Substitution Rates in Ribosomal RNA Genes Genetics. ,vol. 141, pp. 771- 783 ,(1995) , 10.1093/GENETICS/141.2.771