Modeling relations and their mentions without labeled text

作者: Sebastian Riedel , Limin Yao , Andrew McCallum

DOI: 10.1007/978-3-642-15939-8_10

关键词:

摘要: Several recent works on relation extraction have been applying the distant supervision paradigm: instead of relying on annotated text to learn how to predict relations, they employ existing knowledge bases (KBs) as source of supervision. Crucially, these approaches are trained based on the assumption that each sentence which mentions the two related entities is an expression of the given relation. Here we argue that this leads to noisy patterns that hurt precision, in particular if the knowledge base is not directly related to the text we are …

参考文章(29)
Sebastian Riedel, Andrew McCallum, Limin Yao, Sameer Singh, Constraint-Driven Rank-Based Learning for Information Extraction north american chapter of the association for computational linguistics. pp. 729- 732 ,(2010)
Sameer Singh, Karl Schultz, Andrew McCallum, Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs european conference on machine learning. ,vol. 5782, pp. 414- 429 ,(2009) , 10.1007/978-3-642-04174-7_27
Mark Craven, Johan Kumlien, Constructing Biological Knowledge Bases by Extracting Information from Text Sources intelligent systems in molecular biology. pp. 77- 86 ,(1999)
Kedar Bellare, Andrew McCallum, Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment empirical methods in natural language processing. pp. 131- 140 ,(2009) , 10.3115/1699510.1699528
Aron Culotta, Andrew McCallum, Joint deduplication of multiple record types in relational data Proceedings of the 14th ACM international conference on Information and knowledge management - CIKM '05. pp. 257- 258 ,(2005) , 10.1145/1099554.1099615
Dmitry Zelenko, Chinatsu Aone, Anthony Richardella, Kernel methods for relation extraction Journal of Machine Learning Research. ,vol. 3, pp. 1083- 1106 ,(2003) , 10.1162/153244303322533205
Stuart Geman, Donald Geman, Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-6, pp. 721- 741 ,(1984) , 10.1109/TPAMI.1984.4767596
Claus S. Jensen, Uffe Kjærulff, Augustine Kong, Blocking Gibbs sampling in very large probabilistic expert systems International Journal of Human-computer Studies \/ International Journal of Man-machine Studies. ,vol. 42, pp. 647- 666 ,(1995) , 10.1006/IJHC.1995.1029
Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, Jamie Taylor, Freebase Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08. pp. 1247- 1250 ,(2008) , 10.1145/1376616.1376746