作者: K.T Pedretti , T.L Casavant , T.E Scheetz , C.L Birkett , C.A Roberts
DOI: 10.1016/S0167-739X(00)00057-1
关键词: DNA sequencing 、 Distributed computing 、 Server 、 Set (abstract data type) 、 Human genome 、 Functional genomics 、 Service (systems architecture) 、 Mode (computer interface) 、 Sequence 、 Parallel computing 、 Gene 、 Distributed database 、 Genome project 、 Computer science
摘要: Abstract This paper describes approaches to improve the performance of one most common and increasingly important aspects Human Genome Project (HGP) — large-volume, batch comparison DNA sequence data. basic operation, usually carried out by well-known BLAST program on subject against internationally available databases nearly five million target sequences, is already used hundreds thousands times each day researchers around world. At present, it still primarily in single query, or small query mode. As entire human genome nears completion, area functional genomics, use micro-arrays sets genes, coming fore. These developments will demand ever more efficient means BLASTing data that make processor implementation powerful workstations infeasible. We describe three primary parallel components BLAST. The first at sequence-to-sequence level. second parallelizes a across partitioned distributed database. Finally, set queries themselves are servers with replicated databases. methods may be employed alone concert. Our current described which requests, our plans for other levels also described. results ultimately applied hardware assistance this soon-to-be primitive computer operation.