Semantic Rule Filtering for Web-Scale Relation Extraction

作者: Andrea Moro , Hong Li , Sebastian Krause , Feiyu Xu , Roberto Navigli

DOI: 10.1007/978-3-642-41335-3_22

关键词: Natural language processingArtificial intelligenceSemantic compressionSemantic networkSemantic similarityRelation (database)SemanticsSet (abstract data type)Semantic computingComputer scienceInformation retrievalRelationship extraction

摘要: Web-scale relation extraction is a means for building and extending large repositories of formalized knowledge. This type automated knowledge requires decent level precision, which hard to achieve with automatically acquired rule sets learned from unlabeled data by distant or minimal supervision. paper shows how precision can be considerably improved employing wide-coverage, general-purpose lexical semantic network, i.e., BabelNet, effective filtering. We apply Word Sense Disambiguation the content words extracted rules. As result set relation-specific relevant concepts obtained, each these then used represent structured semantics corresponding relation. The resulting subgraphs BabelNet are as filters estimating adequacy For seven relations tested here, filter consistently yields higher at any relative recall value in high-recall range.

参考文章(65)
Mihai Surdeanu, Massimiliano Ciaramita, Robust Information Extraction with Perceptrons ,(2007)
Eduard Hovy, Zornitsa Kozareva, A Semi-Supervised Method to Learn and Construct Taxonomies Using the Web empirical methods in natural language processing. pp. 1110- 1118 ,(2010)
Eugene Agichtein, Confidence Estimation Methods for Partially Supervised Information Extraction. siam international conference on data mining. pp. 539- 543 ,(2006)
Günter Neumann, Alexander Volokh, 372:Comparing the Benefit of Different Dependency Parsers for Textual Entailment Using Syntactic Constraints Only meeting of the association for computational linguistics. pp. 308- 312 ,(2010)
Oren Etzioni, Alexander Yates, Unsupervised Resolution of Objects and Relations on the Web north american chapter of the association for computational linguistics. pp. 121- 130 ,(2007)
Christian Chiarcos, Sebastian Nordhoff, Sebastian Hellmann, Linked data in linguistics : representing and connecting language data and language metadata Springer. ,(2012)
Sergey Brin, Extracting Patterns and Relations from the World Wide Web Lecture Notes in Computer Science. pp. 172- 183 ,(1999) , 10.1007/10704656_11
Michael J. Cafarella, Oren Etzioni, Stephen Soderland, Michele Banko, Matt Broadhead, Open information extraction from the web international joint conference on artificial intelligence. pp. 2670- 2676 ,(2007)
Hans Uszkoreit, Learning Relation Extraction Grammars with Minimal Human Intervention: Strategy, Results, Insights and Plans Computational Linguistics and Intelligent Text Processing. pp. 106- 126 ,(2011) , 10.1007/978-3-642-19437-5_9