MACJa: Metadata and Citations Jailbreaker

作者: Andrea Giovanni Nuzzolese , Silvio Peroni , Diego Reforgiato Recupero

DOI: 10.1007/978-3-319-25518-7_10

关键词:

摘要: This paper presents the Metadata And Citations Jailbreaker (a.k.a. MACJa – IPA /’matsja/), i.e., a method for processing research papers available in CEUR-WS.org and stored as PDF files order to extract relevant semantic data publish them RDF triplestore according Semantic Publishing Referencing (SPAR) Ontologies. In particular, extraction of all information needed addressing queries Challenge 2015 (task 2) is guaranteed by using techniques based on Natural Language Processing (i.e., Combinatory Categorial Grammar, Discourse Representation Theory, Linguistic Frames), Web technologies good Ontology Design practices Content Analysis, Patterns, Referent Extraction Linking, Topic Extraction).

参考文章(28)
Aldo Gangemi, A Comparison of Knowledge Extraction Tools for the Semantic Web extended semantic web conference. ,vol. 7882, pp. 351- 366 ,(2013) , 10.1007/978-3-642-38288-8_24
Aldo Gangemi, Andrea Giovanni Nuzzolese, Valentina Presutti, Francesco Draicchio, Alberto Musetti, Paolo Ciancarini, Automatic typing of DBpedia entities international semantic web conference. pp. 65- 81 ,(2012) , 10.1007/978-3-642-35176-1_5
Anastasia Dimou, Miel Vander Sande, Pieter Colpaert, Laurens De Vocht, Ruben Verborgh, Erik Mannens, Rik Van de Walle, Extraction and Semantic Annotation of Workshop Proceedings in HTML Using RML Communications in Computer and Information Science. ,vol. 475, pp. 114- 119 ,(2014) , 10.1007/978-3-319-12024-9_15
Mathieu d'Aquin, Sofia Angeletou, Laurian Gridinoc, Marta Sabou, Claudio Baldassarre, Enrico Motta, Watson: supporting next generation semantic web applications ,(2007)
Hans Kamp, A Theory of Truth and Semantic Representation Methods in the Study of Language Representation. pp. 329- 369 ,(2008) , 10.1163/9789004252882_014
Valentina Presutti, Francesco Draicchio, Aldo Gangemi, Knowledge extraction based on discourse representation theory and linguistic frames knowledge acquisition, modeling and management. ,vol. 7603, pp. 114- 129 ,(2012) , 10.1007/978-3-642-33876-2_12
Pieter Colpaert, Anastasia Dimou, Erik Mannens, Miel Vander Sande, Rik Van De Walle, Extending R2RML to a source-independent mapping language for RDF international semantic web conference. pp. 237- 240 ,(2013)
Silvio Peroni, David Shotton, FaBiO and CiTO Journal of Web Semantics. ,vol. 17, pp. 33- 43 ,(2012) , 10.1016/J.WEBSEM.2012.08.001
Dominika Tkaczyk, Pawel Szostek, Piotr Jan Dendek, Mateusz Fedoryszak, Lukasz Bolikowski, CERMINE -- Automatic Extraction of Metadata and References from Scientific Literature document analysis systems. pp. 217- 221 ,(2014) , 10.1109/DAS.2014.63