Parallelization of local BLAST service on workstation clusters

作者： K.T Pedretti , T.L Casavant , T.E Scheetz , C.L Birkett , C.A Roberts

关键词: DNA sequencing 、 Distributed computing 、 Server 、 Set (abstract data type) 、 Human genome 、 Functional genomics 、 Service (systems architecture) 、 Mode (computer interface) 、 Sequence 、 Parallel computing 、 Gene 、 Distributed database 、 Genome project 、 Computer science

摘要: Abstract This paper describes approaches to improve the performance of one most common and increasingly important aspects Human Genome Project (HGP) — large-volume, batch comparison DNA sequence data. basic operation, usually carried out by well-known BLAST program on subject against internationally available databases nearly five million target sequences, is already used hundreds thousands times each day researchers around world. At present, it still primarily in single query, or small query mode. As entire human genome nears completion, area functional genomics, use micro-arrays sets genes, coming fore. These developments will demand ever more efficient means BLASTing data that make processor implementation powerful workstations infeasible. We describe three primary parallel components BLAST. The first at sequence-to-sequence level. second parallelizes a across partitioned distributed database. Finally, set queries themselves are servers with replicated databases. methods may be employed alone concert. Our current described which requests, our plans for other levels also described. results ultimately applied hardware assistance this soon-to-be primitive computer operation.

sciencedirect.com 本地加速

sciencedirect.com LINK 下载加速

sci-hub.se PDF 下载加速

参考文章(6)

Robert L. Henderson, Job Scheduling Under the Portable Batch System job scheduling strategies for parallel processing. pp. 279- 294 ,(1995) , 10.1007/3-540-60153-8_34

M Berks, The C. elegans genome sequencing project. C. elegans Genome Mapping and Sequencing Consortium. Genome Research. ,vol. 5, pp. 99- 104 ,(1995) , 10.1101/GR.5.2.99

J. Sulston, Z. Du, K. Thomas, R. Wilson, L. Hillier, R. Staden, N. Halloran, P. Green, J. Thierry-Mieg, L. Qiu, S. Dear, A. Coulson, M. Craxton, R. Durbin, M. Berks, M. Metzstein, T. Hawkins, R. Ainscough, R. Waterston, The C. elegans genome sequencing project: a beginning Nature. ,vol. 356, pp. 37- 41 ,(1992) , 10.1038/356037A0

J. T. Eppig, The Mouse Genome Database (MGD): from genes to mice—a community resource for mouse biology Nucleic Acids Research. ,vol. 33, pp. 471- 475 ,(2004) , 10.1093/NAR/GKI113

J. A. Blake, J. E. Richardson, M. T. Davisson, J. T. Eppig, , The Mouse Genome Database (MGD). A comprehensive public resource of genetic, phenotypic and genomic data. The Mouse Genome Informatics Group. Nucleic Acids Research. ,vol. 25, pp. 85- 91 ,(1997) , 10.1093/NAR/25.1.85

Zheng Zhang, Webb Miller, David J Lipman, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research. ,vol. 25, pp. 3389- 3402 ,(1997) , 10.1093/NAR/25.17.3389

Parallelization of local BLAST service on workstation clusters

来源期刊

我的账户

Parallelization of local BLAST service on workstation clusters

来源期刊

相似文章 10

我的账户