Birth and death of protein domains: a simple model of evolution explains power law behavior.

作者: Georgy P Karev , Yuri I Wolf , Andrey Y Rzhetsky , Faina S Berezovskaya , Eugene V Koonin

DOI: 10.1186/1471-2148-2-18

关键词:

摘要: Power distributions appear in numerous biological, physical and other contexts, which to be fundamentally different. In biology, power laws have been claimed describe the of connections enzymes metabolites metabolic networks, number interactions partners a given protein, members paralogous families, quantities. network analysis, imply evolution with preferential attachment, i.e. greater likelihood nodes being added pre-existing hubs. Exploration different types evolutionary models an attempt determine them lead law has potential revealing non-trivial aspects genome evolution. A simple model domain composition proteomes was developed, following elementary processes: i) birth (duplication divergence), ii) death (inactivation and/or deletion), iii) innovation (emergence from non-coding or non-globular sequences acquisition via horizontal gene transfer). This formalism can described as b irth, d eath i nnovation m odel (BDIM). The formulas for equilibrium frequencies families size total at are derived general BDIM. All asymptotics possible type found their appearance depending on parameters is investigated. It proved that appears if, only balanced, duplication deletion rates asymptotically equal up second order. further any asymptotic degree not -1 if hypothesis independence duplication/deletion family rejected. Specific cases BDIMs, namely simple, linear, polynomial rational models, considered details determined each case. We apply BDIM analysis prokaryotic eukaryotic show excellent fit between these empirical data particular form model, second-order balanced linear Calculation suggests surprisingly high rates, comparable (duplication) elimination particularly genomes. straightforward evolution, does explicitly include selection, sufficient explain observed sizes, asymptotic. However, compatible data, there precise balance birth, this likely maintained by selection. developed approach oriented mathematical description proteomes, but reformulation could applied evolving networks attachment.

参考文章(47)
Ilya Shmulevich, Wei Zhang, Computational and Statistical Approaches to Genomics ,(2007)
Stephen Jay Gould, The Structure of Evolutionary Theory ,(2002)
S. N. Dorogovtsev, J. F. F. Mendes, Scaling properties of scale-free evolving networks: continuous approach. Physical Review E. ,vol. 63, pp. 056125- ,(2001) , 10.1103/PHYSREVE.63.056125
Eugene V Koonin, L Aravind, Alexey S Kondrashov, The Impact of Comparative Genomics on Our Understanding of Evolution Cell. ,vol. 101, pp. 573- 576 ,(2000) , 10.1016/S0092-8674(00)80867-3
Steven E. Brenner, Tim Hubbard, Alexey Murzin, Cyrus Chothia, Gene duplications in H. influenzae. Nature. ,vol. 378, pp. 140- 140 ,(1995) , 10.1038/378140A0
P. L. Krapivsky, S. Redner, Organization of growing random networks Physical Review E. ,vol. 63, pp. 066123- ,(2001) , 10.1103/PHYSREVE.63.066123
Yuri I Wolf, Nick V Grishin, Eugene V Koonin, Estimating the number of protein folds and families from complete genome data Journal of Molecular Biology. ,vol. 299, pp. 897- 905 ,(2000) , 10.1006/JMBI.2000.3786
Iosif Il"ich Gihman, Anatolii Vladimirovich Skorohod, The theory of stochastic processes ,(1974)
Albert-László Barabási, Réka Albert, Emergence of Scaling in Random Networks Science. ,vol. 286, pp. 509- 512 ,(1999) , 10.1126/SCIENCE.286.5439.509