Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond.

作者： Elizabeth M. Blue , Lei Sun , Nathan L. Tintle , Ellen M. Wijsman

关键词:

摘要: When analyzing family data, we dream of perfectly informative even whole-genome sequences (WGSs) for all members. Reality intervenes, and find that next-generation sequencing (NGS) data have errors are often too expensive or impossible to collect on everyone. The Genetic Analysis Workshop 18 working groups quality control dropping WGSs through families using a genome-wide association framework focused finding, correcting, within the available sequence developing methods infer analyze missing among relatives, testing linkage with simulated blood pressure. We found single-nucleotide polymorphisms, NGS imputed generally concordant but particularly likely at rare variants, homozygous genotypes, regions repeated structural from unrelated individuals. Admixture complicated identification cryptic relatedness, information Mendelian transmission improved error detection provided an estimate de novo mutation rate. Computationally, fast rule-based imputation was accurate could not cover as many loci subjects more computationally demanding probability-based methods. Incorporating population-level into pedigree-based results. Observed outperformed in testing, were also useful. discuss strengths weaknesses existing suggest possible future directions, such improving communication between collectors analysts, establishing thresholds quality, incorporating analytical models.

参考文章(55)

R. W. Cottingham, M. Kimmel, M. G. Ehm, Error detection for genetic data, using likelihood methods. American Journal of Human Genetics. ,vol. 58, pp. 225- 234 ,(1996)

Buetow Kh, Influence of aberrant observations on high-resolution linkage analysis outcomes. American Journal of Human Genetics. ,vol. 49, pp. 985- 994 ,(1991)

James L. Weber, Karl W. Broman, 7 Genotyping for human whole-genome scans: Past, present, and future Advances in Genetics. ,vol. 42, pp. 77- 96 ,(2001) , 10.1016/S0065-2660(01)42016-5

Sarah B. Ng, Emily H. Turner, Peggy D. Robertson, Steven D. Flygare, Abigail W. Bigham, Choli Lee, Tristan Shaffer, Michelle Wong, Arindam Bhattacharjee, Evan E. Eichler, Michael Bamshad, Deborah A. Nickerson, Jay Shendure, Targeted capture and massively parallel sequencing of 12 human exomes Nature. ,vol. 461, pp. 272- 276 ,(2009) , 10.1038/NATURE08250

August N Blackburn, Angela K Dean, Donna M Lehman, Imputation in families using a heuristic phasing approach BMC Proceedings. ,vol. 8, pp. 1- 5 ,(2014) , 10.1186/1753-6561-8-S1-S16

Xin Li, Jing Li, Haplotype reconstruction in large pedigrees with untyped individuals through IBD inference. Journal of Computational Biology. ,vol. 18, pp. 1411- 1421 ,(2011) , 10.1089/CMB.2011.0167

Chris C. A. Spencer, Zhan Su, Peter Donnelly, Jonathan Marchini, Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip PLoS Genetics. ,vol. 5, pp. e1000477- ,(2009) , 10.1371/JOURNAL.PGEN.1000477

Dajun Qian, Lars Beckmann, Minimum-Recombinant Haplotyping in Pedigrees American Journal of Human Genetics. ,vol. 70, pp. 1434- 1445 ,(2002) , 10.1086/340610

Sunah Song, Robert Shields, Xin Li, Jing Li, Joint analysis of sequence data and single-nucleotide polymorphism data using pedigree information for imputation and recombination inference BMC Proceedings. ,vol. 8, pp. 1- 6 ,(2014) , 10.1186/1753-6561-8-S1-S20

10.

Bryan N. Howie, Peter Donnelly, Jonathan Marchini, A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLOS Genetics. ,vol. 5, ,(2009) , 10.1371/JOURNAL.PGEN.1000529

Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond.

来源期刊

我的账户

Value of Mendelian laws of segregation in families: data quality control, imputation, and beyond.

来源期刊

相似文章 3

Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

Mendelian inheritance errors in whole genome sequenced trios are enriched in repeats and cluster within copy number losses

Mendelian Inconsistent Signatures from 1314 Ancestrally Diverse Family Trios Distinguish Biological Variation from Sequencing Error.

我的账户