QCKer-FPGA: An FPGA Implementation of Q- gram Counting Filter for DNA Sequence Alignment

作者: Jan Carlo G. Maghirang , Roger Luis Uy , Kaizen Vinz A. Borja , Joven L. Pernez

DOI: 10.1109/HNICEM48295.2019.9072768

关键词: Computer hardwareFlexibility (engineering)Metric (mathematics)GramFilter (signal processing)Field-programmable gate arrayProcess (computing)GenomicsComputer scienceReference genome

摘要: Read mapping is a process in which DNA reads are mapped to reference genome through filtering and verification using predefined metric. Filtering done by quickly eliminating incorrect regions when read compared the genome. Verification on other hand responsible for verifying these candidate require mathematical theoretical approaches. Due large amounts of data produced Next Generation Sequencing (NGS) platforms, filter needed reduce various computational challenges introduced process. FPGAs special purpose processors that designed handle compute-intensive applications, having highly customizable fabric. In this paper, q-gram counting implemented takes advantage flexibility capabilities parallel applications ZedBoard development board. The paper discusses results with varying sizes q, number lengths, different sequences. show an average 34.02% lesser clock cycles length 4 53.58% 8 implementation C.

参考文章(10)
Stefan Burkhardt, Filter algorithms for approximate string matching Universität des Saarlandes Saarbrücken. pp. 1- 134 ,(2002) , 10.22028/D291-25685
Samantha B. Foley, Jonathan J. Rios, Victoria E. Mgbemena, Linda S. Robinson, Heather L. Hampel, Amanda E. Toland, Leslie Durham, Theodora S. Ross, Use of Whole Genome Sequencing for Diagnosis and Discovery in the Cancer Genetics Clinic. EBioMedicine. ,vol. 2, pp. 74- 81 ,(2015) , 10.1016/J.EBIOM.2014.12.003
Kim R Rasmussen, Jens Stoye, Eugene W Myers, None, Efficient q -gram filters for finding all ε-matches over a given length research in computational molecular biology. ,vol. 3500, pp. 189- 203 ,(2005) , 10.1007/11415770_15
Ian Kuon, Jonathan Rose, Russell Tessier, FPGA Architecture: Survey and Challenges ,(2008)
Juha Kärkkäinen, Computing the Threshold for q-Gram Filters scandinavian workshop on algorithm theory. pp. 348- 357 ,(2002) , 10.1007/3-540-45471-3_36
David Weese, Manuel Holtgrewe, Knut Reinert, RazerS 3 Bioinformatics. ,vol. 28, pp. 2592- 2599 ,(2012) , 10.1093/BIOINFORMATICS/BTS505
Apostolos Dollas, Christos Kozanitis, Euripides Sotiriades, FPGA based architecture for DNA sequence comparison and database search international parallel and distributed processing symposium. pp. 193- 193 ,(2006) , 10.5555/1898953.1899138
K. Benkrid, Ying Liu, A. Benkrid, A Highly Parameterized and Efficient FPGA-Based Skeleton for Pairwise Biological Sequence Alignment IEEE Transactions on Very Large Scale Integration Systems. ,vol. 17, pp. 561- 570 ,(2009) , 10.1109/TVLSI.2008.2005314
Nathaniel Mcvicar, Chih-Ching Lin, Scott Hauck, K-Mer Counting Using Bloom Filters with an FPGA-Attached HMC field programmable custom computing machines. pp. 203- 210 ,(2017) , 10.1109/FCCM.2017.23
James Arram, FPGA acceleration of DNA sequencing analysis and storage Imperial College London. ,(2017) , 10.25560/56867