CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data.

作者: Jonathan S. Packer , Evan K. Maxwell , Colm O’Dushlaine , Alexander E. Lopez , Frederick E. Dewey

DOI: 10.1093/BIOINFORMATICS/BTV547

关键词:

摘要: Motivation: Several algorithms exist for detecting copy number variants (CNVs) from human exome sequencing read depth, but previous tools have not been well suited large population studies on the order of tens or hundreds thousands exomes. Their limitations include being difficult to integrate into automated variant-calling pipelines and ill-suited common variants. To address these issues, we developed a new algorithm—Copy estimation using Lattice-Aligned Mixture Models (CLAMMS)—which is highly scalable suitable CNVs across whole allele frequency spectrum. Results: In this note, summarize methods intended use-case CLAMMS, compare it briefly describe results validation experiments. We evaluate adherence CNV calls CLAMMS four other Mendelian inheritance patterns pedigree; SNP genotyping arrays set 3164 samples; use TaqMan quantitative polymerase chain reaction validate predicted by at 39 loci (95% rare validate; 19 variant loci, mean precision recall are 99% 94%, respectively). Supplementary Materials (available Github repository), present our in greater detail. Availability implementation: https://github.com/rgcgithub/clamms (implemented C). Contact: moc.noreneger@dier.yerffej Supplementary information: data available Bioinformatics online.

参考文章(7)
Peter D. Stenson, Edward V. Ball, Matthew Mort, Andrew D. Phillips, Katy Shaw, David N. Cooper, The Human Gene Mutation Database (HGMD) and its exploitation in the fields of personalized genomics and molecular evolution. Current protocols in human genetics. ,vol. 39, ,(2012) , 10.1002/0471250953.BI0113S39
Robert E Handsaker, Vanessa Van Doren, Jennifer R Berman, Giulio Genovese, Seva Kashin, Linda M Boettger, Steven A McCarroll, Large multiallelic copy number variations in humans Nature Genetics. ,vol. 47, pp. 296- 303 ,(2015) , 10.1038/NG.3200
Daniel Backenroth, Jason Homsy, Laura R Murillo, Joe Glessner, Edwin Lin, Martina Brueckner, Richard Lifton, Elizabeth Goldmuntz, Wendy K Chung, Yufeng Shen, None, CANOES: detecting rare copy number variants from whole exome sequencing data Nucleic Acids Research. ,vol. 42, ,(2014) , 10.1093/NAR/GKU345
Menachem Fromer, Jennifer L. Moran, Kimberly Chambert, Eric Banks, Sarah E. Bergen, Douglas M. Ruderfer, Robert E. Handsaker, Steven A. McCarroll, Michael C. O’Donovan, Michael J. Owen, George Kirov, Patrick F. Sullivan, Christina M. Hultman, Pamela Sklar, Shaun M. Purcell, Discovery and Statistical Genotyping of Copy-Number Variation from Whole-Exome Sequencing Depth American Journal of Human Genetics. ,vol. 91, pp. 597- 607 ,(2012) , 10.1016/J.AJHG.2012.08.005
Vincent Plagnol, James Curtis, Michael Epstein, Kin Y. Mok, Emma Stebbings, Sofia Grigoriadou, Nicholas W. Wood, Sophie Hambleton, Siobhan O. Burns, Adrian J. Thrasher, Dinakantha Kumararatne, Rainer Doffinger, Sergey Nejentsev, A robust model for read count data in exome sequencing experiments and implications for copy number variant calling Bioinformatics. ,vol. 28, pp. 2747- 2754 ,(2012) , 10.1093/BIOINFORMATICS/BTS526
K. Wang, M. Li, D. Hadley, R. Liu, J. Glessner, S. F.A. Grant, H. Hakonarson, M. Bucan, PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data Genome Research. ,vol. 17, pp. 1665- 1674 ,(2007) , 10.1101/GR.6861907
N. Krumm, P. H. Sudmant, A. Ko, B. J. O'Roak, M. Malig, B. P. Coe, A. R. Quinlan, D. A. Nickerson, E. E. Eichler, , Copy number variation detection and genotyping from exome sequence data Genome Research. ,vol. 22, pp. 1525- 1532 ,(2012) , 10.1101/GR.138115.112