Improving Entity Linking using Surface Form Refinement

作者: Eric Charton , Marie-Jean Meurs , Ludovic Jean-Louis , Michel Gagnon

DOI:

关键词:

摘要: In this paper, we present an algorithm for improving named entity resolution and linking by using surface form generation rewriting. Surface forms consist of a word or group words that matches lexical units like Paris New York City. Used as matching sequences to select candidate entries in knowledge base, they contribute the disambiguation those candidates through similarity measures. context, misspelled textual (entities) can be impossible identify due lack available forms. To address problem, propose refinement based on Wikipedia resources. The approach extends coverage our system, rewrites reformulates mentions prior starting annotation process. is evaluated corpus associated with monolingual English task NIST KBP 2013. We show improves system performance.

参考文章(22)
W. W. Cohen and P. Ravikumar and S. Fienberg, A Comparison of String Metrics for Matching Names and Records ,(2003)
Ilaria Bordino, Manfred Pinkal, Stefan Thater, Gerhard Weikum, Johannes Hoffart, Marc Spaniol, Hagen Fürstenau, Mohamed Amir Yosef, Bilyana Taneva, Robust Disambiguation of Named Entities in Text empirical methods in natural language processing. pp. 782- 792 ,(2011)
AJ Lait, B Randell, An Assessment of Name Matching Algorithms Department of Computing Science Technical Report Series. ,(1996)
Juan-Manuel Torres-Moreno, Eric Charton, NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems language resources and evaluation. ,(2010)
Aldo Gangemi, A Comparison of Knowledge Extraction Tools for the Semantic Web extended semantic web conference. ,vol. 7882, pp. 351- 366 ,(2013) , 10.1007/978-3-642-38288-8_24
Jun'ichi Kazama, Kentaro Torisawa, Exploiting Wikipedia as External Knowledge for Named Entity Recognition empirical methods in natural language processing. pp. 698- 707 ,(2007)
Razvan C. Bunescu, Marius Pasca, Using Encyclopedic Knowledge for Named Entity Disambiguation conference of the european chapter of the association for computational linguistics. ,(2006)
Borislav Popov, Atanas Kiryakov, Angel Kirilov, Dimitar Manov, Damyan Ognyanoff, Miroslav Goranov, KIM: semantic annotation platform international semantic web conference. pp. 834- 849 ,(2003) , 10.1007/978-3-540-39718-2_53
Robert A. Wagner, Michael J. Fischer, The String-to-String Correction Problem Journal of the ACM. ,vol. 21, pp. 168- 173 ,(1974) , 10.1145/321796.321811
Eric Charton, Frederic Bechet, Unsupervised knowledge acquisition for Extracting Named Entities from speech international conference on acoustics, speech, and signal processing. pp. 5338- 5341 ,(2010) , 10.1109/ICASSP.2010.5494962