Challenges as enablers for high quality Linked Data: insights from the Semantic Publishing Challenge

作者: Anastasia Dimou , Sahar Vahdati , Angelo Di Iorio , Christoph Lange , Ruben Verborgh

DOI: 10.7717/PEERJ-CS.105

关键词:

摘要: While most challenges organized so far in the Semantic Web domain are focused on comparing tools with respect to different criteria such as their features and competencies, or exploiting semantically enriched data, Evaluation Challenges series, co-located ESWC Conference, aims compare them based output, namely produced dataset. The Publishing Challenge is one of these challenges. Its goal involve participants extracting data from heterogeneous sources scholarly publications, producing Linked Data that can be exploited by community itself. This paper reviews lessons learned both (i) overall organization Challenge, regarding definition tasks, building input dataset forming evaluation, (ii) results participants, proposed approaches, used tools, preferred vocabularies three editions 2014, 2015 2016. We compared other In this paper, we distill best practices for organizing could applied similar events, report observations publishing derived submitted solutions. conclude higher quality may achieved when a result challenge, because competition becomes an incentive, while solutions become better they evaluated against rules challenge.

参考文章(35)
Terence Catapano, TaxPub: An Extension of the NLM/NCBI Journal Publishing DTD for Taxonomic Descriptions National Center for Biotechnology Information (US). ,(2010)
Diego Reforgiato Recupero, Erik Cambria, None, ESWC'14 Challenge on Concept-Level Sentiment Analysis Semantic Web Evaluation Challenges. pp. 211- 222 ,(2014) , 10.1007/978-3-319-25518-7_18
Maxim Kolchin, Fedor Kozlov, A Template-Based Information Extraction from Web Sites with Unstable Markup Communications in Computer and Information Science. pp. 89- 94 ,(2014) , 10.1007/978-3-319-12024-9_11
Francesco Ronzano, Gerard Casamayor del Bosque, Horacio Saggion, Semantify CEUR-WS Proceedings: Towards the Automatic Generation of Highly Descriptive Scholarly Publishing Linked Datasets Communications in Computer and Information Science. pp. 83- 88 ,(2014) , 10.1007/978-3-319-12024-9_10
Max Schmachtenberg, Christian Bizer, Heiko Paulheim, Adoption of the Linked Data Best Practices in Different Topical Domains The Semantic Web – ISWC 2014. pp. 245- 260 ,(2014) , 10.1007/978-3-319-11964-9_16
Anastasia Dimou, Miel Vander Sande, Pieter Colpaert, Laurens De Vocht, Ruben Verborgh, Erik Mannens, Rik Van de Walle, Extraction and Semantic Annotation of Workshop Proceedings in HTML Using RML Communications in Computer and Information Science. ,vol. 475, pp. 114- 119 ,(2014) , 10.1007/978-3-319-12024-9_15
Marc Bertin, Iana Atanassova, Extraction and Characterization of Citations in Scientific Papers Communications in Computer and Information Science. pp. 120- 126 ,(2014) , 10.1007/978-3-319-12024-9_16
Dominika Tkaczyk, Łukasz Bolikowski, Extracting Contextual Information from Scientific Literature Using CERMINE System Semantic Web Evaluation Challenges. pp. 93- 104 ,(2015) , 10.1007/978-3-319-25518-7_8
Bahar Sateli, René Witte, Automatic Construction of a Semantic Knowledge Base from CEUR Workshop Proceedings extended semantic web conference. ,vol. 548, pp. 129- 141 ,(2015) , 10.1007/978-3-319-25518-7_11
Amrapali Zaveri, Anisa Rula, Andrea Maurino, Ricardo Pietrobon, Jens Lehmann, Sören Auer, Quality assessment for Linked Data: A Survey Social Work. ,vol. 7, pp. 63- 93 ,(2015) , 10.3233/SW-150175