作者: Kim R. Rasmussen , Jens Stoye , Eugene W. Myers
关键词:
摘要: Fast and exact comparison of large genomic sequences remains a challenging task in biosequence analysis. We consider the problem finding all epsilon-matches between two sequences, i.e., local alignments over given length with an error rate at most epsilon. study this theoretically, giving efficient q-gram filter for solving it. Two applications are also discussed, particular sequence assembly BLAST-like comparison. Our results show that method is 25 times faster than BLAST, while not being heuristic.