IMPUTATION-BASED ASSESSMENT OF NEXT GENERATION RARE EXOME VARIANT ARRAYS

作者: ALICIA R. MARTIN , GERARD TSE , CARLOS D. BUSTAMANTE , EIMEAR E. KENNY

DOI: 10.1142/9789814583220_0024

关键词:

摘要: A striking finding from recent large-scale sequencing efforts is that the vast majority of variants in human genome are rare and found within single populations or lineages. These observations hold important implications for design next round disease variant discovery efforts—if genetic influence risk follow same trend, then we expect to see population-specific associations require large samples sizes detection. To address this challenge, due still prohibitive cost cohorts, researchers have developed a new generation low-cost genotyping arrays assay variation previously identified exome studies. Genotyping approaches rely not only on directly observing variants, but also phasing imputation methods use publicly available reference panels infer unobserved study cohort. Rare intentionally enriched likely be causing, here ability first commercially array (the Illumina Infinium HumanExome BeadChip) tag other potentially damaging molecularly assayed. Using full sequence data chromosome 22 phase I 1000 Genomes Project, evaluate three (BEAGLE, MaCH-Admix, SHAPEIT2/IMPUTE2) with under varied panel sizes, LD structures via population differences. We find more accurate across both common than all allele frequencies, including alleles. least African populations, accuracy substantially improved when included panel. Depending goals GWAS researchers, our results will aid budget decisions by helping determine whether money best spent genomes smaller sample larger and/or imputing SNPs, some combination two.

参考文章(24)
Kimmo Palin, Harry Campbell, Alan F. Wright, James F. Wilson, Richard Durbin, Identity-by-descent-based phasing and imputation in founder populations using graphical models. Genetic Epidemiology. ,vol. 35, pp. 853- 860 ,(2011) , 10.1002/GEPI.20635
Bryan Howie, Jonathan Marchini, Matthew Stephens, Genotype Imputation with Thousands of Genomes G3: Genes, Genomes, Genetics. ,vol. 1, pp. 457- 470 ,(2011) , 10.1534/G3.111.001198
Brian L. Browning, Sharon R. Browning, A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals American Journal of Human Genetics. ,vol. 84, pp. 210- 223 ,(2009) , 10.1016/J.AJHG.2009.01.005
Olivier Delaneau, Jonathan Marchini, Jean-François Zagury, A linear complexity phasing method for thousands of genomes Nature Methods. ,vol. 9, pp. 179- 181 ,(2012) , 10.1038/NMETH.1785
Alkes L. Price, Arti Tandon, Nick Patterson, Kathleen C. Barnes, Nicholas Rafaels, Ingo Ruczinski, Terri H. Beaty, Rasika Mathias, David Reich, Simon Myers, Sensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations PLoS Genetics. ,vol. 5, pp. e1000519- ,(2009) , 10.1371/JOURNAL.PGEN.1000519
Lucy Huang, Mattias Jakobsson, Trevor J. Pemberton, Muntaser Ibrahim, Thomas Nyambo, Sabah Omar, Jonathan K. Pritchard, Sarah A. Tishkoff, Noah A. Rosenberg, Haplotype variation and genotype imputation in African populations. Genetic Epidemiology. ,vol. 35, pp. 766- 780 ,(2011) , 10.1002/GEPI.20626
Amy L. Williams, Nick Patterson, Joseph Glessner, Hakon Hakonarson, David Reich, Phasing of Many Thousands of Genotyped Samples American Journal of Human Genetics. ,vol. 91, pp. 238- 251 ,(2012) , 10.1016/J.AJHG.2012.06.013
Jonathan Marchini, Bryan Howie, Genotype imputation for genome-wide association studies Nature Reviews Genetics. ,vol. 11, pp. 499- 511 ,(2010) , 10.1038/NRG2796
Elizabeth T. Cirulli, David B. Goldstein, Uncovering the roles of rare variants in common disease through whole-genome sequencing Nature Reviews Genetics. ,vol. 11, pp. 415- 425 ,(2010) , 10.1038/NRG2779