作者: Erik L. L. Sonnhammer , Richard Durbin
DOI:
关键词:
摘要: When confronted with the task of finding homology to large numbers sequences, database searching tools such as Blast and Fasta generate prohibitively amounts information. An automatic way making most decisions a trained sequence analyst would make was developed by means rule-based expert system combined an algorithm avoid non-informative biased residue composition matches. The results found relevant are presented in very concise clear way, so that can be assessed minimum effort. system, HSPcrunch, implemented process output programs BLAST suite. HSPcrunch embodies rules on detecting distant similarities when pairs weak matches consistent larger gapped alignment, i.e. has broken longer alignment up into smaller ungapped ones. This more detected no or little side-effects spurious for how small gaps must considered significant have been derived empirically. Currently set used operate two different scoring levels, one medium slightly gaps. proved robust cases gives high fidelity separation between real homologies One important reducing amount is limit number overlapping same region query sequence.(ABSTRACT TRUNCATED AT 250 WORDS)