作者: Andrea Giovanni Nuzzolese , Silvio Peroni , Diego Reforgiato Recupero
DOI: 10.1007/978-3-319-25518-7_10
关键词:
摘要: This paper presents the Metadata And Citations Jailbreaker (a.k.a. MACJa – IPA /’matsja/), i.e., a method for processing research papers available in CEUR-WS.org and stored as PDF files order to extract relevant semantic data publish them RDF triplestore according Semantic Publishing Referencing (SPAR) Ontologies. In particular, extraction of all information needed addressing queries Challenge 2015 (task 2) is guaranteed by using techniques based on Natural Language Processing (i.e., Combinatory Categorial Grammar, Discourse Representation Theory, Linguistic Frames), Web technologies good Ontology Design practices Content Analysis, Patterns, Referent Extraction Linking, Topic Extraction).