A posteriori metadata from automated provenance tracking: integration of AiiDA and TCOD

作者: Andrius Merkys , Nicolas Mounet , Andrea Cepellotti , Nicola Marzari , Saulius Gražulis

DOI: 10.1186/S13321-017-0242-Y

关键词:

摘要: In order to make results of computational scientific research findable, accessible, interoperable and re-usable, it is necessary decorate them with standardised metadata. However, there are a number technical practical challenges that this process difficult achieve in practice. Here the implementation protocol presented tag crystal structures their computed properties, without need human intervention curate data. This leverages capabilities AiiDA, an open-source platform manage automate workflows, TCOD, open-access database storing materials properties using well-defined exhaustive ontology. Based on these, complete procedure deposit data TCOD automated. All relevant metadata extracted from full provenance information AiiDA tracks stores automatically while managing calculations. Such also enables reproducibility field science. As proof concept, AiiDA–TCOD interface used 170 theoretical together graphs, consisting over 4600 nodes.

参考文章(43)
P. R. Mallinson, I. D. Brown, Classification and use of electron density data International Tables for Crystallography. ,(2006) , 10.1107/97809553602060000737
P. M. D. Fitzgerald, J. D. Westbrook, P. E. Bourne, B. McMahon, K. D. Watenpaugh, H. M. Berman, Macromolecular dictionary (mmCIF) International Tables for Crystallography. pp. 295- 443 ,(2006) , 10.1107/97809553602060000745
Luc Moreau, Juliana Freire, Joe Futrelle, Robert E. McGrath, Jim Myers, Patrick Paulson, The Open Provenance Model: An Overview international provenance and annotation workshop. pp. 323- 326 ,(2008) , 10.1007/978-3-540-89965-5_31
N. Freed, N. Borenstein, Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies RFC. ,vol. 2045, pp. 1- 31 ,(1996)
Jürg Hutter, Marcella Iannuzzi, Florian Schiffmann, Joost VandeVondele, cp2k: atomistic simulations of condensed matter systems Wiley Interdisciplinary Reviews: Computational Molecular Science. ,vol. 4, pp. 15- 25 ,(2014) , 10.1002/WCMS.1159
Hareesh Rajan, Hinako Uchida, Deborah L. Bryan, Ranjini Swaminathan, Robert T. Downs, Michelle Hall-Wallace, Building the American Mineralogist Crystal Structure Database: A recipe for construction of a small Internet database Geoinformatics: Data to Knowledge. ,vol. 397, pp. 73- 80 ,(2006) , 10.1130/2006.2397(06)
B. McMahon, General considerations when defining a CIF data item International Tables for Crystallography. pp. 73- 91 ,(2006) , 10.1107/97809553602060000733
Giovanni Pizzi, Andrea Cepellotti, Riccardo Sabatini, Nicola Marzari, Boris Kozinsky, AiiDA: automated interactive infrastructure and database for computational science Computational Materials Science. ,vol. 111, pp. 218- 230 ,(2016) , 10.1016/J.COMMATSCI.2015.09.013
M. Valiev, E.J. Bylaska, N. Govind, K. Kowalski, T.P. Straatsma, H.J.J. Van Dam, D. Wang, J. Nieplocha, E. Apra, T.L. Windus, W.A. de Jong, NWChem: a comprehensive and scalable open-source solution for large scale molecular simulations Computer Physics Communications. ,vol. 181, pp. 1477- 1489 ,(2010) , 10.1016/J.CPC.2010.04.018
James E. Saal, Scott Kirklin, Muratahan Aykol, Bryce Meredig, C. Wolverton, Materials Design and Discovery with High-Throughput Density Functional Theory: The Open Quantum Materials Database (OQMD) JOM. ,vol. 65, pp. 1501- 1509 ,(2013) , 10.1007/S11837-013-0755-4