Predictions of Native American Population Structure Using Linguistic Covariates in a Hidden Regression Framework

作者: Flora Jay , Olivier François , Michael G. B. Blum

DOI: 10.1371/JOURNAL.PONE.0016227

关键词:

摘要: BACKGROUND: The mainland of the Americas is home to a remarkable diversity languages, and relationships between genes languages have attracted considerable attention in past. Here we investigate which extent geography can predict genetic structure Native American populations. METHODOLOGY/PRINCIPAL FINDINGS: Our approach based on Bayesian latent cluster regression model membership explained by geographic linguistic covariates. After correcting for effects, find that inclusion information improves prediction individual clusters. We further compare predictive power Greenberg's Ethnologue classifications Amerindian languages. report classification provides better proxy than at stock group levels. Although high values be achieved from classification, nevertheless emphasize Choco, Chibchan Tupi families do not exhibit univocal correspondence with CONCLUSIONS/SIGNIFICANCE: class described here efficient predicting population using

参考文章(69)
Francisco Silva Noelli, The Tupi Expansion Springer, New York, NY. pp. 659- 670 ,(2008) , 10.1007/978-0-387-74907-5_33
P E Smouse, H Gershowitz, J Azofeifa, J V Neel, R Barrantes, H W Mohrenweiser, T D Arias, Microevolution in lower Central America: genetic characterization of the Chibcha-speaking groups of Costa Rica and Panama, and a consensus taxonomy based on genetic and linguistic affinity. American Journal of Human Genetics. ,vol. 46, pp. 63- 84 ,(1990)
L. Campbell, Long-Range Comparison: Methodological Disputes Encyclopedia of Language & Linguistics (Second Edition). pp. 324- 331 ,(2006) , 10.1016/B0-08-044854-2/01898-8
Paul M. Lewis, Ethnologue : languages of the world SIL International. ,(2009)
Detmar Meurers, Robert Levine, Encyclopedia of Language and Linguistics ,(2006)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Merritt Ruhlen, A guide to the world's languages ,(1987)
Padhraic Smyth, Model selection for probabilistic clustering using cross-validatedlikelihood Statistics and Computing. ,vol. 10, pp. 63- 72 ,(2000) , 10.1023/A:1008940618127
Jukka Corander, Mikko J Sillanpää, Patrik Waldmann, Bayesian analysis of genetic differentiation between populations. Genetics. ,vol. 163, pp. 367- 374 ,(2003) , 10.1093/GENETICS/163.1.367