The 1000 Genomes Project: data management and community access

作者: Laura Clarke , Xiangqun Zheng-Bradley , Richard Smith , Eugene Kulesha , Chunlin Xiao

DOI: 10.1038/NMETH.1974

关键词:

摘要: The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to primary scientific goals creating both a deep catalog human genetic variation extensive methods accurately discover characterize using new sequencing technologies, project makes all its publicly available. Members coordination center have developed deployed several tools enable widespread access.

参考文章(15)
Nicole L. Washington, E. O. Stinson, Marc D. Perry, Peter Ruzanov, Sergio Contrino, Richard Smith, Zheng Zha, Rachel Lyne, Adrian Carr, Paul Lloyd, Ellen Kephart, Sheldon J. McKay, Gos Micklem, Lincoln D. Stein, Suzanna E. Lewis, The modENCODE Data Coordination Center: lessons in harvesting comprehensive experimental details Database. ,vol. 2011, ,(2011) , 10.1093/DATABASE/BAR023
H. Li, B. Handsaker, A. Wysoker, T. Fennell, J. Ruan, N. Homer, G. Marth, G. Abecasis, R. Durbin, , The Sequence Alignment/Map format and SAMtools Bioinformatics. ,vol. 25, pp. 2078- 2079 ,(2009) , 10.1093/BIOINFORMATICS/BTP352
G. A. Thorisson, The International HapMap Project Web site web science. ,vol. 15, pp. 1592- 1593 ,(2005) , 10.1101/GR.4413105
Martin Shumway, Guy Cochrane, Hideaki Sugawara, Archiving next generation sequencing data Nucleic Acids Research. ,vol. 38, pp. 870- 871 ,(2010) , 10.1093/NAR/GKP1078
Petr Danecek, Adam Auton, Goncalo Abecasis, Cornelis A Albers, Eric Banks, Mark A DePristo, Robert E Handsaker, Gerton Lunter, Gabor T Marth, Stephen T Sherry, Gilean McVean, Richard Durbin, 1000 Genomes Project Analysis Group, None, The variant call format and VCFtools Bioinformatics. ,vol. 27, pp. 2156- 2158 ,(2011) , 10.1093/BIOINFORMATICS/BTR330
K. R. Rosenbloom, T. R. Dreszer, M. Pheasant, G. P. Barber, L. R. Meyer, A. Pohl, B. J. Raney, T. Wang, A. S. Hinrichs, A. S. Zweig, P. A. Fujita, K. Learned, B. Rhead, K. E. Smith, R. M. Kuhn, D. Karolchik, D. Haussler, W. J. Kent, ENCODE whole-genome data in the UCSC Genome Browser Nucleic Acids Research. ,vol. 38, pp. 620- 625 ,(2010) , 10.1093/NAR/GKP961
Richard M Durbin, Matt E Hurles, Gil A McVean, Richard A Gibbs, Gonçalo R Abecasis, David Altshuler, David Altshuler, Adam Auton, Lisa D Brooks, A Map of Human Genome Variation From Population-Scale Sequencing Nature. ,vol. 467, pp. 1061- 1073 ,(2010) , 10.1038/NATURE09534
, Prepublication data sharing Nature. ,vol. 461, pp. 168- 170 ,(2009) , 10.1038/461168A