Base compositional structure of genomes

作者: James W. Fickett , David C. Torney , David R. Wolf

DOI: 10.1016/0888-7543(92)90019-O

关键词:

摘要: Abstract We model the base compositional structure of human and Escherichia coli genomes. Three particular properties are first quantified: (1) There is a significant tendency for any region either genome to have strand-symmetric composition. (2) The variation in composition from region, within each genome, very much larger than expected common homogeneous stochastic models. (3) A given local tends persist over scale at least kilobases ( E. ) or tens (human). Multidomain models literature reviewed sharpened. In particular, quantitative measurements third property lead us suggest shift style domain models, which + T content with position modeled by random walk frequent small steps rather large quantum jumps. As an application, we way reduce amount computation assembly sequences randomly chosen fragments.

参考文章(19)
Jan Filipski, Jean-Paul Thiery, Giorgio Bernardi, An analysis of the bovine genome by Cs2SO4-Ag density gradient centrifugation. Journal of Molecular Biology. ,vol. 80, pp. 177- 197 ,(1973) , 10.1016/0022-2836(73)90240-4
G CHURCHILL, Stochastic models for heterogeneous DNA sequences Bulletin of Mathematical Biology. ,vol. 51, pp. 79- 94 ,(1989) , 10.1016/S0092-8240(89)80049-7
Genshiro Kitagawa, Non-Gaussian State—Space Modeling of Nonstationary Time Series Journal of the American Statistical Association. ,vol. 82, pp. 1032- 1041 ,(1987) , 10.1080/01621459.1987.10478534
G. Ott, Compact encoding of stationary Markov sources IEEE Transactions on Information Theory. ,vol. 13, pp. 82- 86 ,(1967) , 10.1109/TIT.1967.1053960
Laura Manuelidis, David C. Ward, Chromosomal and nuclear distribution of the HindIII 1.9-kb human DNA repeat segment Chromosoma. ,vol. 91, pp. 28- 38 ,(1984) , 10.1007/BF00286482
R.A. Elton, Theoretical models for heterogeneity of base composition in DNA Journal of Theoretical Biology. ,vol. 45, pp. 533- 553 ,(1974) , 10.1016/0022-5193(74)90129-5
Richard W. Katz, On Some Criteria for Estimating the Order of a Markov Chain Technometrics. ,vol. 23, pp. 243- ,(1981) , 10.1080/00401706.1981.10486293
Christian Burks, Doyne Farmer, Towards modeling DNA sequences as automata Physica D: Nonlinear Phenomena. ,vol. 10, pp. 157- 167 ,(1984) , 10.1016/0167-2789(84)90258-6