作者: Pieter Heyvaert , Anastasia Dimou , Ruben Verborgh , Erik Mannens , Rik Van de Walle
DOI: 10.1007/978-3-319-25518-7_14
关键词: Computer science 、 RDF 、 Task (project management) 、 Semantic publishing 、 Information retrieval 、 SPARQL 、 Set (abstract data type) 、 Quality (business)
摘要: In this paper, we present our solution for the first task of second Semantic Publishing Challenge. The requires extracting and semantically annotating information regarding ceur-ws workshops, their chairs conference affiliations, as well papers authors, from a set html-encoded workshop proceedings volumes. Our builds on last year’s submission, while address number shortcomings, assess generated dataset its quality publish queries sparql query templates. This is accomplished using rdf Mapping Language (rml) to define mappings, rmlprocessor execute them, rdfunit both validate mapping documents dataset’s quality, datatank results in an overall improved that reflected results.