Automatic Construction of a Semantic Knowledge Base from CEUR Workshop Proceedings

作者: Bahar Sateli , René Witte

DOI: 10.1007/978-3-319-25518-7_11

关键词:

摘要: We present an automatic workflow that performs text segmentation and entity extraction from scientific literature to primarily address Task 2 of the Semantic Publishing Challenge 2015. The goal is extract various information full-text papers represent context in which a document written, such as affiliation its authors corresponding funding bodies. Our proposed solution composed two subsystems: (i) A mining pipeline, developed based on GATE framework, extracts structural semantic entities, authors’ references, produces (typed) annotations; (ii) flexible exporting module, LODeXporter, translates annotations into RDF triples according custom mapping rules. Additionally, we leverage existing Named Entity Recognition (NER) tools named entities ground them their resources Linked Open Data cloud, thus, briefly covering 3 objectives, involves linking detected open datasets. output our system graph stored scalable TDB-based storage with public SPARQL endpoint for task’s queries.

参考文章(5)
Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, A framework and graphical development environment for robust NLP tools and applications. meeting of the association for computational linguistics. pp. 168- 175 ,(2002)
Tudor Groza, Siegfried Handschuh, Knud Möller, Stefan Decker, SALT - Semantically Annotated $\mbox{\LaTeX}$ for Scientific Publications european semantic web conference. ,vol. 4519, pp. 518- 532 ,(2007) , 10.1007/978-3-540-72667-8_37
Bahar Sateli, René Witte, Elian Angius, Marie-Jean Meurs, Greg Butler, Justin Powlowski, Adrian Tsang, None, Supporting Researchers with a Semantic Literature Management Wiki The 4th Workshop on Semantic Publishing (SePublica 2014). ,vol. 1155, ,(2014)
Alexandru Constantin, Silvio Peroni, Steve Pettifer, David Shotton, Fabio Vitali, The Document Components Ontology (DoCO) Social Work. ,vol. 7, pp. 167- 181 ,(2016) , 10.3233/SW-150177