Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies

作者: Miao-Xin Li , Johnny S. H. Kwan , Su-Ying Bao , Wanling Yang , Shu-Leong Ho

DOI: 10.1371/JOURNAL.PGEN.1003143

关键词:

摘要: Exome sequencing is becoming a standard tool for mapping Mendelian disease-causing (or pathogenic) non-synonymous single nucleotide variants (nsSNVs). Minor allele frequency (MAF) filtering approach and functional prediction methods are commonly used to identify candidate pathogenic mutations in these studies. Combining multiple may increase accuracy prediction. Here, we propose use logit model combine compute an unbiased probability of rare variant being pathogenic. Also, the first time assess predictive power seven (including SIFT, PolyPhen2, CONDEL, logit) predicting nsSNVs from other variants, which reflects situation after MAF done exome-sequencing We found that combining all or some original outperforms examined, but unable discriminate between autosomal dominant recessive disease mutations. Finally, based on predictions model, estimate individual has around 5% carries ∼22 derived alleles at least, if made homozygous by consanguineous marriages lead diseases.

参考文章(46)
Scott-Emuakpor Ab, The mutation load in an African population. I. An analysis of consanguineous marriages in Nigeria. American Journal of Human Genetics. ,vol. 26, pp. 674- 682 ,(1974)
PC Sham, SM Purcell, SH Kwan, Introduction to biometrical genetics Taylor & Francis Group. ,(2008)
Montgomery Slatkin, Rasmus Nielsen, An Introduction to Population Genetics: Theory and Applications ,(2013)
Saima Riazuddin, Caley M. Castelein, Zubair M. Ahmed, Anil K. Lalwani, Mary A. Mastroianni, Sadaf Naz, Tenesha N. Smith, Nikki A. Liburd, Thomas B. Friedman, Andrew J. Griffith, Sheikh Riazuddin, Edward R. Wilcox, Dominant modifier DFNM1 suppresses recessive deafness DFNB26 Nature Genetics. ,vol. 26, pp. 431- 434 ,(2000) , 10.1038/82558
Herman M. Slatis, A method of estimating the frequency of abnormal autosomal recessive genes in man. American Journal of Human Genetics. ,vol. 6, pp. 412- 418 ,(1954)
Robert E. Hoene, Herman M. Slatis, Raymond H. Reis, Consanguineous marriages in the Chicago region. American Journal of Human Genetics. ,vol. 10, pp. 446- 464 ,(1958)
Sarah B. Ng, Emily H. Turner, Peggy D. Robertson, Steven D. Flygare, Abigail W. Bigham, Choli Lee, Tristan Shaffer, Michelle Wong, Arindam Bhattacharjee, Evan E. Eichler, Michael Bamshad, Deborah A. Nickerson, Jay Shendure, Targeted capture and massively parallel sequencing of 12 human exomes Nature. ,vol. 461, pp. 272- 276 ,(2009) , 10.1038/NATURE08250
Jesse Davis, Mark Goadrich, The relationship between Precision-Recall and ROC curves Proceedings of the 23rd international conference on Machine learning - ICML '06. ,vol. 148, pp. 233- 240 ,(2006) , 10.1145/1143844.1143874
K. Wang, M. Li, H. Hakonarson, ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data Nucleic Acids Research. ,vol. 38, ,(2010) , 10.1093/NAR/GKQ603
Abel González-Pérez, Nuria López-Bigas, Improving the Assessment of the Outcome of Nonsynonymous SNVs with a Consensus Deleteriousness Score, Condel American Journal of Human Genetics. ,vol. 88, pp. 440- 449 ,(2011) , 10.1016/J.AJHG.2011.03.004