作者: Yi Huang , Susanna K. P. Lau , Patrick C. Y. Woo , Kwok-yung Yuen
DOI: 10.1093/NAR/GKM754
关键词: Sequence analysis 、 Sequence (medicine) 、 Genomics 、 Database 、 Coronavirus 、 GenBank 、 FASTA format 、 Annotation 、 Genetics 、 Biology 、 Genome
摘要: The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available GenBank. number species with genomes increased from 9 2003 to 25 which six, HKU1, bat coronavirus, group 1 HKU2, groups 2c 2d coronaviruses, were sequenced by our laboratory. To overcome problems we encountered existing databases during comparative analysis, built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), annotated genes genomes. provides convenient platform for rapid accurate batch retrieval, cornerstone bottleneck gene or genome analysis. Sequences can be directly downloaded website FASTA format. also detailed annotation all sequences using standardized nomenclature system, overcomes duplicated identical other databases. For single representative each is analysis such as phylogenetic studies. With CoVDB, specific blast search results generated efficient downstream