From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics

作者: Alejandra González-Beltrán , Peter Li , Jun Zhao , Maria Susana Avila-Garcia , Marco Roos

DOI: 10.1371/JOURNAL.PONE.0127612

关键词:

摘要: Motivation Reproducing the results from a scientific paper can be challenging due to absence of data and computational tools required for their analysis. In addition, details relating procedures used obtain published difficult discern use natural language when reporting how experiments have been performed. The Investigation/Study/Assay (ISA), Nanopublications (NP), Research Objects (RO) models are conceptual modelling frameworks that structure such information papers. Computational workflow platforms also reproduce analyses in principled manner. We assessed extent by which ISA, NP, RO models, together with Galaxy system, capture experimental processes findings previously on development SOAPdenovo2, de novo genome assembler. Results Executable workflows were developed using Galaxy, reproduced consistent findings. A structured representation SOAPdenovo2 was produced combining models. By structuring these frameworks, it possible explicitly declare elements design, variables, served as guides curation this led identification inconsistencies original paper, thereby allowing its authors publish corrections form an errata. Availability SOAPdenovo2 scripts, data, available through GigaScience Database: http://dx.doi.org/10.5524/100044; GigaGalaxy: http://galaxy.cbiit.cuhk.edu.hk; representations case study website http://isa-tools.github.io/soapdenovo2/. Contact: philippe.rocca-serra@oerc.ox.ac.uk susanna-assunta.sansone@oerc.ox.ac.uk.

参考文章(56)
Kevin Page, Raúl Palma, Piotr Hoªubowicz, Graham Klyne, Stian Soiland-Reyes, Don Cruickshank, Rafael González Cabero, Esteban García, David De Roure Cuesta, Jun Zhao, José Manuel Gómez-Pérez, From workflows to Research Objects: an architecture for preserving the semantics of science Proceedings of the 2nd International Workshop on Linked Science. 2012;.. ,(2012)
Herman Stehouwer, Research data alliance the 5th African Conference for Digital Scholarship & Curation 2013. ,(2013)
Paul Groth, Andrew Gibson, Jan Velterop, The anatomy of a nanopublication Information services & use. ,vol. 30, pp. 51- 56 ,(2010) , 10.3233/ISU-2010-0613
Massimo Pigliucci, The end of theory in science EMBO Reports. ,vol. 10, pp. 534- 534 ,(2009) , 10.1038/EMBOR.2009.111
Pekka Kohonen, Emilio Benfenati, David Bower, Rebecca Ceder, Michael Crump, Kevin Cross, Roland C. Grafström, Lyn Healy, Christoph Helma, Nina Jeliazkova, Vedrin Jeliazkov, Silvia Maggioni, Scott Miller, Glenn Myatt, Michael Rautenberg, Glyn Stacey, Egon Willighagen, Jeff Wiseman, Barry Hardy, The ToxBank Data Warehouse: Supporting the Replacement of In Vivo Repeated Dose Systemic Toxicity Testing Molecular Informatics. ,vol. 32, pp. 47- 63 ,(2013) , 10.1002/MINF.201200114
C. Glenn Begley, Lee M. Ellis, Drug development: Raise standards for preclinical cancer research Nature. ,vol. 483, pp. 531- 533 ,(2012) , 10.1038/483531A
Alejandra Gonzalez-Beltran, Eamonn Maguire, Pavlos Georgiou, Susanna-Assunta Sansone, Philippe Rocca-Serra, Bio-GraphIIn: a graph-based, integrative and semantically-enabled repository for life science experimental data EMBnet.journal. ,vol. 19, pp. 46- 50 ,(2013) , 10.14806/EJ.19.B.728
Darrel C. Ince, Leslie Hatton, John Graham-Cumming, The case for open computer programs Nature. ,vol. 482, pp. 485- 488 ,(2012) , 10.1038/NATURE10836