Vecuum: identification and filtration of false somatic variants caused by recombinant vector contamination

作者: Junho Kim , Ju Heon Maeng , Jae Seok Lim , Hyeonju Son , Junehawk Lee

DOI: 10.1093/BIOINFORMATICS/BTW383

关键词:

摘要: Motivation Advances in sequencing technologies have remarkably lowered the detection limit of somatic variants to a low frequency. However, calling mutations at this range is still confounded by many factors including environmental contamination. Vector contamination continuously occurring issue and especially problematic since vector inserts are hardly distinguishable from sample sequences. Such inserts, which may harbor polymorphisms engineered functional mutations, can result false corresponding sites. Numerous vector-screening methods been developed, but none could handle because they focusing on backbone sequences alone. Results We developed novel method-Vecuum-that identifies vector-originated reads resultant variants. Since generally constructed intron-less cDNAs, Vecuum inspecting clipping patterns exon junctions. False variant calls further detected based biased distribution mutant alleles reads. Tests simulated spike-in experimental data validated that detect 93% contaminants remove up 87% variant-like with 100% precision. Application public sequence datasets demonstrated utility detecting resulting various types external Availability implementation Java-based method available http://vecuum.sourceforge.net/ CONTACT: swkim@yuhs.acSupplementary information: Supplementary Bioinformatics online.

参考文章(38)
Ka-Wei Tang, Kristoffer Hellstrand, Erik Larsson, Absence of cytomegalovirus in high-coverage DNA sequencing of human glioblastoma multiforme. International Journal of Cancer. ,vol. 136, pp. 977- 981 ,(2015) , 10.1002/IJC.29042
Marisa Pearce, Amy Cullinan, Grant Hogg, Dana Hosseini, Mathias Ehrich, Mutation profiling in tumor samples using the Sequenom OncoCarta|[trade]| Panel Nature Methods. ,vol. 6, ,(2009) , 10.1038/NMETH.F.254
Zhi-Yong Tao, Xu Sui, Cao Jun, Richard Culleton, Qiang Fang, Hui Xia, Qi Gao, None, Vector sequence contamination of the Plasmodium vivax sequence database in PlasmoDB and In silico correction of 26 parasite sequences. Parasites & Vectors. ,vol. 8, pp. 318- 318 ,(2015) , 10.1186/S13071-015-0927-X
Martin Laurence, Christos Hatzis, Douglas E. Brash, Common Contaminants in Next-Generation Sequencing That Hinder Discovery of Low-Abundance Microbes PLoS ONE. ,vol. 9, pp. e97876- ,(2014) , 10.1371/JOURNAL.PONE.0097876
Michael J. Strong, Guorong Xu, Lisa Morici, Sandra Splinter Bon-Durant, Melody Baddoo, Zhen Lin, Claire Fewell, Christopher M. Taylor, Erik K. Flemington, Microbial Contamination in Next Generation Sequencing: Implications for Sequence-Based Analysis of Clinical Samples PLoS Pathogens. ,vol. 10, pp. e1004437- ,(2014) , 10.1371/JOURNAL.PPAT.1004437
Jae Seok Lim, Woo-il Kim, Hoon-Chul Kang, Se Hoon Kim, Ah Hyung Park, Eun Kyung Park, Young-Wook Cho, Sangwoo Kim, Ho Min Kim, Jeong A Kim, Junho Kim, Hwanseok Rhee, Seok-Gu Kang, Heung Dong Kim, Daesoo Kim, Dong-Seok Kim, Jeong Ho Lee, None, Brain somatic mutations in MTOR cause focal cortical dysplasia type II leading to intractable epilepsy Nature Medicine. ,vol. 21, pp. 395- 400 ,(2015) , 10.1038/NM.3824
Juan Falgueras, Antonio J Lara, Noe Fernandez-Pozo, Francisco R. Canton, Guillermo Perez-Trabado, M. Gonzalo Claros, SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read BMC Bioinformatics. ,vol. 11, pp. 38- 38 ,(2010) , 10.1186/1471-2105-11-38
Ka-Wei Tang, Babak Alaei-Mahabadi, Tore Samuelsson, Magnus Lindh, Erik Larsson, The landscape of viral expression and host gene fusion and adaptation in human cancer. Nature Communications. ,vol. 4, pp. 2513- 2513 ,(2013) , 10.1038/NCOMMS3513
Andrew Roth, Jaswinder Khattra, Damian Yap, Adrian Wan, Emma Laks, Justina Biele, Gavin Ha, Samuel Aparicio, Alexandre Bouchard-Côté, Sohrab P Shah, None, PyClone: statistical inference of clonal population structure in cancer Nature Methods. ,vol. 11, pp. 396- 398 ,(2014) , 10.1038/NMETH.2883