Population Genetics of Ancient and Modern DNA

作者: Anna-Sapfo Malaspinas

DOI:

关键词:

摘要: In this work, I develop computational tools focused around the utilization of DNA sequence data to address questions relative to forensic science, medical genetics, human evolution and ancient DNA. First, I compute the theoretical probability that two individual profiles match by chance at two loci in a subdivided population. This question is of particular interest in forensic science, where DNA evidence has become a widespread tool of investigation and criminal conviction. I find that the effect of ignoring population subdivision can be unfavorable to the defendant, but that the two loci can essentially be treated as unlinked. Second, I develop a method to identify genes that are interacting, or in epistasis, to produce a particular phenotype. Determining interacting genes is indeed of particular relevance in medical genetics to help map disease genes. I validate the method with simulations and demonstrate an improved performance over existing approaches. I also apply the method to recently available genomic data from domesticated dogs, identifying genes in epistastis for the hair length phenotype - thus representing candidate genes for functional validation. Third, I use a summary statistic of DNA sequences, the site frequency spectrum, to estimate parameters of recent human history, and to characterize the potential event of admixture between Neanderthals and humans. I find evidence for recent gene flow between Neanderthals and Europeans, and to a lesser extent between Neanderthals and Africans. Finally, I develop a likelihood method to jointly estimate the age and selection coefficient of an identified mutation, along with the population size, by using time serial samples. Such datasets are widespread in the fields of ancient DNA as well as experimental and viral evolution. I validate the method through simulations. I re-analyze a recent dataset for a locus coding for the distribution of black pigmentation in horses - and estimate that the allele far predates domestication, arising between 20,000 and 13,000 years ago.

参考文章(113)
Richard E. Green, Johannes Krause, Susan E. Ptak, Adrian W. Briggs, Michael T. Ronan, Jan F. Simons, Lei Du, Michael Egholm, Jonathan M. Rothberg, Maja Paunovic, Svante Pääbo, Analysis of one million base pairs of Neanderthal DNA Nature. ,vol. 444, pp. 330- 336 ,(2006) , 10.1038/NATURE05336
Vinayak Eswaran, Henry Harpending, Alan R. Rogers, Genomics refutes an exclusively African origin of humans Journal of Human Evolution. ,vol. 49, pp. 1- 18 ,(2005) , 10.1016/J.JHEVOL.2005.02.006
C. Nusbaum, E. S. Lander, C. Russ, N. Novod, J. Affourtit, M. Egholm, C. Verna, P. Rudan, D. Brajkovic, Z. Kucan, I. Gusic, V. B. Doronichev, L. V. Golovanova, C. Lalueza-Fox, M. de la Rasilla, J. Fortea, A. Rosas, R. W. Schmitz, P. L. F. Johnson, E. E. Eichler, D. Falush, E. Birney, J. C. Mullikin, M. Slatkin, R. Nielsen, J. Kelso, M. Lachmann, D. Reich, S. Paabo, R. E. Green, J. Krause, A. W. Briggs, T. Maricic, U. Stenzel, M. Kircher, N. Patterson, H. Li, W. Zhai, M. H. Y. Fritz, N. F. Hansen, E. Y. Durand, A. S. Malaspinas, J. D. Jensen, T. Marques-Bonet, C. Alkan, K. Prufer, M. Meyer, H. A. Burbano, J. M. Good, R. Schultz, A. Aximu-Petri, A. Butthof, B. Hober, B. Hoffner, M. Siegemund, A. Weihmann, A Draft Sequence of the Neandertal Genome Science. ,vol. 328, pp. 710- 722 ,(2010) , 10.1126/SCIENCE.1188021
Michael D. Purugganan, Kristie A. Mather, Ana L. Caicedo, Nicholas R. Polato, Kenneth M. Olsen, Susan McCouch, The Extent of Linkage Disequilibrium in Rice (Oryza sativa L.) Genetics. ,vol. 177, pp. 2223- 2232 ,(2007) , 10.1534/GENETICS.107.079616
Ryan N. Gutenkunst, Ryan D. Hernandez, Scott H. Williamson, Carlos D. Bustamante, Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data PLoS Genetics. ,vol. 5, pp. e1000695- ,(2009) , 10.1371/JOURNAL.PGEN.1000695
Igor V. Ovchinnikov, Anders Götherström, Galina P. Romanova, Vitaliy M. Kharitonov, Kerstin Lidén, William Goodwin, Molecular analysis of Neanderthal DNA from the northern Caucasus Nature. ,vol. 404, pp. 490- 493 ,(2000) , 10.1038/35006625
C. Duarte, J. Mauricio, P. B. Pettitt, P. Souto, E. Trinkaus, H. van der Plicht, J. Zilhao, The early Upper Paleolithic human skeleton from the Abrigo do Lagar Velho (Portugal) and modern human emergence in Iberia Proceedings of the National Academy of Sciences of the United States of America. ,vol. 96, pp. 7604- 7609 ,(1999) , 10.1073/PNAS.96.13.7604
M. Krings, H. Geisert, R. W. Schmitz, H. Krainitzki, S. Paabo, DNA sequence of the mitochondrial hypervariable region II from the Neandertal type specimen Proceedings of the National Academy of Sciences of the United States of America. ,vol. 96, pp. 5581- 5585 ,(1999) , 10.1073/PNAS.96.10.5581
Jonathan K. Pritchard, Are Rare Variants Responsible for Susceptibility to Complex Diseases American Journal of Human Genetics. ,vol. 69, pp. 124- 137 ,(2001) , 10.1086/321272