CluStrat: A Structure Informed Clustering Strategy for Population Stratification

作者: Aritra Bose , Myson C. Burch , Agniva Chowdhury , Peristera Paschou , Petros Drineas

DOI: 10.1007/978-3-030-45257-5_19

关键词:

摘要: Genome-wide association studies (GWAS) have been extensively used to estimate the signed effects of trait-associated alleles. One key challenges in GWAS are confounding factors, such as population stratification, which can lead spurious genotype-trait associations. Recent independent [1, 8, 10] failed replicate strong evidence previously reported signals directional selection on height Europeans UK Biobank cohort, and attributed loss signal cryptic relatedness populations.

参考文章(11)
P. C. Mahalanobis, On the generalized distance in statistics Proceedings of the National Institute of Sciences (Calcutta). ,vol. 2, pp. 49- 55 ,(1936)
W J Ewens, R S Spielman, The transmission/disequilibrium test: history, subdivision, and admixture. American Journal of Human Genetics. ,vol. 57, pp. 455- 464 ,(1995)
Xiang Zhou, Matthew Stephens, Genome-wide efficient mixed-model analysis for association studies Nature Genetics. ,vol. 44, pp. 821- 824 ,(2012) , 10.1038/NG.2310
Hyun Min Kang, Jae Hoon Sul, Susan K Service, Noah A Zaitlen, Sit-yee Kong, Nelson B Freimer, Chiara Sabatti, Eleazar Eskin, Variance component model to account for sample structure in genome-wide association studies Nature Genetics. ,vol. 42, pp. 348- 354 ,(2010) , 10.1038/NG.548
Alkes L Price, Nick J Patterson, Robert M Plenge, Michael E Weinblatt, Nancy A Shadick, David Reich, Principal components analysis corrects for stratification in genome-wide association studies Nature Genetics. ,vol. 38, pp. 904- 909 ,(2006) , 10.1038/NG1847
Mashaal Sohail, Robert M Maier, Andrea Ganna, Alex Bloemendal, Alicia R Martin, Michael C Turchin, Charleston WK Chiang, Joel Hirschhorn, Mark J Daly, Nick Patterson, Benjamin Neale, Iain Mathieson, David Reich, Shamil R Sunyaev, Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies eLife. ,vol. 8, pp. 39702- ,(2019) , 10.7554/ELIFE.39702
Aritra Bose, Vassilis Kalantzis, Eugenia-Maria Kontopoulou, Mai Elkady, Peristera Paschou, Petros Drineas, TeraPCA: a fast and scalable software package to study genetic variation in tera-scale genotypes. Bioinformatics. ,vol. 35, pp. 3679- 3683 ,(2019) , 10.1093/BIOINFORMATICS/BTZ157
Jeremy J Berg, Arbel Harpak, Nasa Sinnott-Armstrong, Anja Moltke Joergensen, Hakhamanesh Mostafavi, Yair Field, Evan August Boyle, Xinjun Zhang, Fernando Racimo, Jonathan K Pritchard, Graham Coop, Reduced signal for polygenic adaptation of height in UK Biobank eLife. ,vol. 8, ,(2019) , 10.7554/ELIFE.39725
Minsun Song, Wei Hao, John D Storey, Testing for genetic associations in arbitrarily structured populations. Nature Genetics. ,vol. 47, pp. 550- 554 ,(2015) , 10.1038/NG.3244