A variable length hash method for faster short read mapping on FPGA

作者: Yoko Sogabe , Tsutomu Maruyama

DOI: 10.1109/FPL.2015.7293938

关键词:

摘要: Short read mapping is a process to align the short reads, which are fixed-length fragments of target genome, given reference genome identify mutations in genome. Because rapid development Next Generation Sequencing (NGS) technologies, faster required. In this paper, we propose variable length hash method further accelerate FPGA systems. hash-based algorithms, fixed sub-string each read, called seed, used as key. However, many different seeds mapped into same slots because high ununiformity human and fruitless key comparisons performed. To equalize slot size, an optimized function that changes bit masks adaptively. With approach, it possible improve performance all systems based on functions. The for comparison our system Xilinx XC7VX690T XC6VLX240T can be improved two-times, total outperforms any existing

参考文章(13)
J. Arram, K. H. Tsoi, Wayne Luk, P. Jiang, Hardware acceleration of genetic sequence alignment applied reconfigurable computing. pp. 13- 24 ,(2013) , 10.1007/978-3-642-36812-7_2
Youkou Sogabe, Tsutomu Maruyama, FPGA acceleration of short read mapping based on sort and parallel comparison 2014 24th International Conference on Field Programmable Logic and Applications (FPL). pp. 1- 4 ,(2014) , 10.1109/FPL.2014.6927404
Nils Homer, Barry Merriman, Stanley F. Nelson, BFAST: An Alignment Tool for Large Scale Genome Resequencing PLoS ONE. ,vol. 4, pp. e7767- 12 ,(2009) , 10.1371/JOURNAL.PONE.0007767
Yoko Sogabe, Tsutomu Maruyama, An acceleration method of short read mapping using FPGA field-programmable technology. pp. 350- 353 ,(2013) , 10.1109/FPT.2013.6718385
H. Li, R. Durbin, Fast and accurate short read alignment with Burrows–Wheeler transform Bioinformatics. ,vol. 25, pp. 1754- 1760 ,(2009) , 10.1093/BIOINFORMATICS/BTP324
Thomas B. Preuber, Oliver Knodel, Rainer G. Spallek, Short-Read Mapping by a Systolic Custom FPGA Computation 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines. pp. 169- 176 ,(2012) , 10.1109/FCCM.2012.37
Ben Langmead, Cole Trapnell, Mihai Pop, Steven L Salzberg, Ultrafast and memory-efficient alignment of short DNA sequences to the human genome Genome Biology. ,vol. 10, pp. 1- 10 ,(2009) , 10.1186/GB-2009-10-3-R25
Edward Fernandez, Walid Najjar, Stefano Lonardi, String Matching in Hardware Using the FM-Index 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines. pp. 218- 225 ,(2011) , 10.1109/FCCM.2011.55
Edward Fernandez, Walid Najjar, Elena Harris, Stefano Lonardi, Exploration of Short Reads Genome Mapping in Hardware field-programmable logic and applications. pp. 360- 363 ,(2010) , 10.1109/FPL.2010.78
Yupeng Chen, Bertil Schmidt, Douglas L Maskell, A hybrid short read mapping accelerator. BMC Bioinformatics. ,vol. 14, pp. 67- 67 ,(2013) , 10.1186/1471-2105-14-67