An acceleration method of short read mapping using FPGA

作者: Yoko Sogabe , Tsutomu Maruyama

DOI: 10.1109/FPT.2013.6718385

关键词:

摘要: The rapid development of Next Generation Sequencing (NGS) has enabled to generate more than 100G base pairs per day from one machine. produced data are randomly fragmented DNA pair strings, called short reads, and millions reads mapped onto the reference genomes, which complete genetic sequences, reconstruct sequence sample DNA. This read mapping is becoming bottle-neck NGS systems. In this paper, we propose an FPGA system for based on a hash-index method. our system, divided into seeds, fixed-length substrings used mapping, seeds sorted using buckets. Then, in each bucket compared parallel with candidate locations. With approach, many can be massively manner their locations, it becomes possible improve processing speed by reducing number random accesses DRAM banks store Furthermore, substitutions nucleotides seed allowed comparison. makes achieve higher matching rates previous works.

参考文章(6)
Nils Homer, Barry Merriman, Stanley F. Nelson, BFAST: An Alignment Tool for Large Scale Genome Resequencing PLoS ONE. ,vol. 4, pp. e7767- 12 ,(2009) , 10.1371/JOURNAL.PONE.0007767
T.F. Smith, M.S. Waterman, Identification of common molecular subsequences. Journal of Molecular Biology. ,vol. 147, pp. 195- 197 ,(1981) , 10.1016/0022-2836(81)90087-5
H. Li, R. Durbin, Fast and accurate short read alignment with Burrows–Wheeler transform Bioinformatics. ,vol. 25, pp. 1754- 1760 ,(2009) , 10.1093/BIOINFORMATICS/BTP324
Ben Langmead, Cole Trapnell, Mihai Pop, Steven L Salzberg, Ultrafast and memory-efficient alignment of short DNA sequences to the human genome Genome Biology. ,vol. 10, pp. 1- 10 ,(2009) , 10.1186/GB-2009-10-3-R25
Corey B. Olson, Maria Kim, Cooper Clauson, Boris Kogon, Carl Ebeling, Scott Hauck, Walter L. Ruzzo, Hardware Acceleration of Short Read Mapping 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines. pp. 161- 168 ,(2012) , 10.1109/FCCM.2012.36
D. J. Wheeler, M. Burrows, A Block-sorting Lossless Data Compression Algorithm ,(1994)