Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond.

作者: Elizabeth M. Blue , Lei Sun , Nathan L. Tintle , Ellen M. Wijsman

DOI: 10.1002/GEPI.21821

关键词:

摘要: When analyzing family data, we dream of perfectly informative even whole-genome sequences (WGSs) for all members. Reality intervenes, and find that next-generation sequencing (NGS) data have errors are often too expensive or impossible to collect on everyone. The Genetic Analysis Workshop 18 working groups quality control dropping WGSs through families using a genome-wide association framework focused finding, correcting, within the available sequence developing methods infer analyze missing among relatives, testing linkage with simulated blood pressure. We found single-nucleotide polymorphisms, NGS imputed generally concordant but particularly likely at rare variants, homozygous genotypes, regions repeated structural from unrelated individuals. Admixture complicated identification cryptic relatedness, information Mendelian transmission improved error detection provided an estimate de novo mutation rate. Computationally, fast rule-based imputation was accurate could not cover as many loci subjects more computationally demanding probability-based methods. Incorporating population-level into pedigree-based results. Observed outperformed in testing, were also useful. discuss strengths weaknesses existing suggest possible future directions, such improving communication between collectors analysts, establishing thresholds quality, incorporating analytical models.

参考文章(55)
R. W. Cottingham, M. Kimmel, M. G. Ehm, Error detection for genetic data, using likelihood methods. American Journal of Human Genetics. ,vol. 58, pp. 225- 234 ,(1996)
Buetow Kh, Influence of aberrant observations on high-resolution linkage analysis outcomes. American Journal of Human Genetics. ,vol. 49, pp. 985- 994 ,(1991)
James L. Weber, Karl W. Broman, 7 Genotyping for human whole-genome scans: Past, present, and future Advances in Genetics. ,vol. 42, pp. 77- 96 ,(2001) , 10.1016/S0065-2660(01)42016-5
Sarah B. Ng, Emily H. Turner, Peggy D. Robertson, Steven D. Flygare, Abigail W. Bigham, Choli Lee, Tristan Shaffer, Michelle Wong, Arindam Bhattacharjee, Evan E. Eichler, Michael Bamshad, Deborah A. Nickerson, Jay Shendure, Targeted capture and massively parallel sequencing of 12 human exomes Nature. ,vol. 461, pp. 272- 276 ,(2009) , 10.1038/NATURE08250
August N Blackburn, Angela K Dean, Donna M Lehman, Imputation in families using a heuristic phasing approach BMC Proceedings. ,vol. 8, pp. 1- 5 ,(2014) , 10.1186/1753-6561-8-S1-S16
Xin Li, Jing Li, Haplotype reconstruction in large pedigrees with untyped individuals through IBD inference. Journal of Computational Biology. ,vol. 18, pp. 1411- 1421 ,(2011) , 10.1089/CMB.2011.0167
Chris C. A. Spencer, Zhan Su, Peter Donnelly, Jonathan Marchini, Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip PLoS Genetics. ,vol. 5, pp. e1000477- ,(2009) , 10.1371/JOURNAL.PGEN.1000477
Dajun Qian, Lars Beckmann, Minimum-Recombinant Haplotyping in Pedigrees American Journal of Human Genetics. ,vol. 70, pp. 1434- 1445 ,(2002) , 10.1086/340610