Submission of Microarray Data to Public Repositories

作者: Catherine A Ball , Alvis Brazma , Helen Causton , Steve Chervitz , Ron Edgar

DOI: 10.1371/JOURNAL.PBIO.0020317

关键词:

摘要: A fundamental principle guiding the publication of scientific results is that data supporting any scholarly work must be made fully available to research community, in a form allows basic conclusions evaluated independently. In context molecular biology, this has typically meant authors paper describing newly sequenced genome, gene, or protein deposit primary permanent, public repository, such as sequence databases maintained by DNA Data Bank Japan (DDBJ), European Bioinformatics Institute (EBI), and National Center for Biotechnology Information (NCBI). Similarly, we, members Microarray Gene Expression Society (MGED; http://www.mged.org), believe all journals should now require submission microarray repositories part process publication. While some have already condition acceptance, we feel requirements applied consistently recognize ArrayExpress (Brazma et. al. 2003), Omnibus (GEO) (Edgar al 2002), Biology Database (CIBEX) (Ikeo 2003) acceptable repositories. To end, MGED propose following new paradigm microarray-based studies. (1) Authors continue take responsibility ensuring collected analyzed their experiments adhere “Minimum about Experiment” (MIAME) guidelines use MIAME checklist (www.mged.org/Workgroups/MIAME/miame_checklist.html) means achieving goal. (2) Scientific are submitted one repositories—ArrayExpress, GEO, CIBEX—in format complies with guidelines. (3) Public establish release protocols assure compliance (4) To assist review process, collaboration publishers provide qualified referees secure accessing prepublication data. strongly encouraged submit during review. Naturally, protected from general prior either authorization submitters, whichever comes first. At minimum, valid accession numbers requirement publication, these included text manuscript allow community find access underlying data. Since its inception 1999, been working broader standards exchange annotation December 2001, proposed et 2001) requested interested parties feedback on relevance utility. The both researchers was overwhelmingly positive, yet almost everyone who responded also asked help implementing guidelines. Subsequently, summer 2002, an open letter various (e.g., Ball 2002a, 2002b) urging adopt We provided so could ensure sufficient information re-analyzed others would available. Again, response extremely most major publications comply standards. adoption greatly improved accessibility data, much it remains individual authors' websites variety formats; consequently, obtaining comparing datasets significant challenge. Clearly need additional include expression repositories. Though might ask why not original recommendation, answer quite simple—MIAME ahead time. NCBI EBI had developed nascent repositories, underway create similar database at DDBJ, submitting considerable burden authors. However, since time, improvements data-entry utilities GEO (www.ncbi.nlm.nih.gov/geo), (www.ebi.ac.uk/arrayexpress), CIBEX (cibex.nig.ac.jp), well growing number commercial academic software packages capable writing MAGE-ML documents (Spellman 2002) can directly databases, lowered barriers point where reconsider requirement. Requiring will distinct advantages entire community. These established commitment continued service providing level assurance published gene into future. Having standardized only make them more accessible, but integrated other relevant including genome sequences, single nucleotide polymorphism haplotype mapping information, literature, resources aid further interpretation patterns. Although many likely links current. Curation authors, reviewers, assuring requirements, enhancing standardization formats enable development analysis integration tools makes easier scientists access, query, share (5) Finally, confidentially, facilitating process. In same way availability profound impact wide range disciplines, requiring deposited necessity accelerate rate discovery. What proposal requires change which approach Both requisite available, because MIAME-compliant time effort, factored timelines. while may consuming painful first, benefits building repository far outweigh initial disadvantages. As always, our sincere hope suggestions stimulate discussion within together arrive consensus ensures widely easily accessible. Finally like urge EBI, towards exchanging

参考文章(7)
Paul T Spellman, Michael Miller, Jason Stewart, Charles Troup, Ugis Sarkans, Steve Chervitz, Derek Bernhart, Gavin Sherlock, Catherine Ball, Marc Lepage, Marcin Swiatek, WL Marks, Jason Goncalves, Scott Markel, Daniel Iordan, Mohammadreza Shojatalab, Angel Pizarro, Joe White, Robert Hubley, Eric Deutsch, Martin Senger, Bruce J Aronow, Alan Robinson, Doug Bassett, Christian J Stoeckert, Alvis Brazma, Design and implementation of microarray gene expression markup language (MAGE-ML) Genome Biology. ,vol. 3, pp. 1- 9 ,(2002) , 10.1186/GB-2002-3-9-RESEARCH0046
Alvis Brazma, Pascal Hingamp, John Quackenbush, Gavin Sherlock, Paul Spellman, Chris Stoeckert, John Aach, Wilhelm Ansorge, Catherine A. Ball, Helen C. Causton, Terry Gaasterland, Patrick Glenisson, Frank C.P. Holstege, Irene F. Kim, Victor Markowitz, John C. Matese, Helen Parkinson, Alan Robinson, Ugis Sarkans, Steffen Schulze-Kremer, Jason Stewart, Ronald Taylor, Jaak Vilo, Martin Vingron, Minimum information about a microarray experiment (MIAME)-toward standards for microarray data. Nature Genetics. ,vol. 29, pp. 365- 371 ,(2001) , 10.1038/NG1201-365
Ron Edgar, Michael Domrachev, Alex E Lash, Gene Expression Omnibus: NCBI gene expression and hybridization array data repository Nucleic Acids Research. ,vol. 30, pp. 207- 210 ,(2002) , 10.1093/NAR/30.1.207
Catherine A Ball, Gavin Sherlock, Helen Parkinson, Philippe Rocca-Sera, Catherine Brooksbank, Helen C Causton, Duccio Cavalieri, Terry Gaasterland, Pascal Hingamp, Frank Holstege, Martin Ringwald, Paul Spellman, CJ Stoeckert Jr, Jason E Stewart, Ronald Taylor, Alvis Brazma, John Quackenbush, Gene Expression Data Microarray, The underlying principles of scientific publication. Bioinformatics. ,vol. 18, pp. 1409- 1409 ,(2002) , 10.1093/BIOINFORMATICS/18.11.1409
Kazuho Ikeo, Jun Ishi-i, Takurou Tamura, Takashi Gojobori, Yoshio Tateno, CIBEX: center for information biology gene expression database. Comptes Rendus Biologies. ,vol. 326, pp. 1079- 1082 ,(2003) , 10.1016/J.CRVI.2003.09.034
Alvis Brazma, Helen Parkinson, Ugis Sarkans, Mohammadreza Shojatalab, Jaak Vilo, Niran Abeygunawardena, Ele Holloway, Misha Kapushesky, Patrick Kemmeren, Gonzalo Garcia Lara, Ahmet Oezcimen, Philippe Rocca-Serra, Susanna-Assunta Sansone, ArrayExpress--a public repository for microarray gene expression data at the EBI Nucleic Acids Research. ,vol. 33, pp. D553- D555 ,(2004) , 10.1093/NAR/GKI056
Catherine A Ball, Gavin Sherlock, Helen Parkinson, Philippe Rocca-Sera, Catherine Brooksbank, Helen C Causton, Duccio Cavalieri, Terry Gaasterland, Pascal Hingamp, Frank Holstege, Martin Ringwald, Paul Spellman, Christian J Stoeckert, Jason E Stewart, Ronald Taylor, Alvis Brazma, John Quackenbush, Standards for Microarray Data Science. ,vol. 298, pp. 539b- 539 ,(2002) , 10.1126/SCIENCE.298.5593.539B