A Genealogical Interpretation of Principal Components Analysis

作者: Gil McVean

DOI: 10.1371/JOURNAL.PGEN.1000686

关键词:

摘要: Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure the distribution of genetic variation across geographical location and ethnic background. However, while often inform about historical demographic processes, little known relationship between fundamental parameters projection samples onto primary axes. Here I show that for SNP data principal can be obtained directly from considering average coalescent times pairs haploid genomes. The result provides framework interpreting PCA projections terms underlying including migration, isolation, admixture. also demonstrate link Wright's f(st) ascertainment has largely simple predictable effect on samples. Using examples human genetics, discuss application these results empirical implications inference.

参考文章(18)
Alan G. Fix, Gene Frequency Clines Produced by Kin-Structured Founder Effects Human Biology. ,vol. 69, pp. 663- 673 ,(1997)
Hilde M. Wilkinson-Herbots, Genealogy and subpopulation differentiation under various models of population structure Journal of Mathematical Biology. ,vol. 37, pp. 535- 585 ,(1998) , 10.1007/S002850050140
John Novembre, Matthew Stephens, Interpreting principal component analyses of spatial population genetic variation. Nature Genetics. ,vol. 40, pp. 646- 649 ,(2008) , 10.1038/NG.139
L. L. Cavalli-Sforza, Paolo Menozzi, Alberto Piazza, The history and geography of human genes ,(1994)
David Reich, Alkes L Price, Nick Patterson, Principal component analysis of genetic data. Nature Genetics. ,vol. 40, pp. 491- 492 ,(2008) , 10.1038/NG0508-491
Guido Barbujani, Robert R. Sokal, Neal L. Oden, Indo‐European origins: A computer‐simulation test of five hypotheses American Journal of Physical Anthropology. ,vol. 96, pp. 109- 132 ,(1995) , 10.1002/AJPA.1330960202
Jinho Baik, Gérard Ben Arous, Sandrine Péché, Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices Annals of Probability. ,vol. 33, pp. 1643- 1697 ,(2005) , 10.1214/009117905000000233
Nick Patterson, Alkes L. Price, David Reich, Population structure and eigenanalysis PLOS Genetics. ,vol. 2, pp. 2074- 2093 ,(2006) , 10.1371/JOURNAL.PGEN.0020190
Montgomery Slatkin, Inbreeding coefficients and coalescence times Genetics Research. ,vol. 58, pp. 167- 175 ,(1991) , 10.1017/S0016672300029827