作者: Marc Bertin , Iana Atanassova
DOI: 10.1007/978-3-319-12024-9_16
关键词:
摘要: We propose a hybrid method for the extraction and characterization of citations in scientific papers using machine learning combined with rule-based approaches. Our protocol consists metadata, bibliography parsing, section titles processing, find-grained semantic annotation on sentence level texts. This allows us to generate Linked Open Data from set research XML.