Entity Mention Detection using a Combination of Redundancy-Driven Classifiers

作者: Roberto Zanoli , Manuela Speranza , Silvana Marianela Bernaola Biggio

DOI:

关键词:

摘要: We present an experimental framework for Entity Mention Detection in which two different classifiers are combined to exploit Data Redundancy attained through the annotation of a large text corpus, as well number Patterns extracted automatically from same corpus. In order recognize proper name, nominal, and pronominal mentions we not only information given by recognized within corpus being annotated, but also occurring external unannotated The system was first evaluated Evalita 2009 evaluation campaign obtaining good results. current version is used applications: on one hand, it LiveMemories project, aims at scaling up content extraction techniques towards very scale multimedia sources. On other annotate corpora, such Italian Wikipedia, thus providing easy access syntactic semantic both Natural Language Processing Information Retrieval communities. Moreover web service available going be integrated into TextPro suite NLP tools.

参考文章(14)
Claire Grover, Marc Moens, Andrei Mikheev, Description of the LTG system used for MUC-7 Seventh Message Understanding Conference (MUC-7): Proceedings of a Conference Held in Fairfax, Virginia, April 29 - May 1, 1998. ,(1998)
Emanuele Pianta, Roberto Zanoli, Christian Girardi, The TextPro Tool Suite language resources and evaluation. ,(2008)
Yorick Wilks, Valentin Tablan, Diana Maynard, Cristian Ursu, Named Entity Recognition from Diverse Text Types ,(2001)
Ralph Grishman, Andrew Eliot Borthwick, A maximum entropy approach to named entity recognition Ph. D. Thesis New York University. ,(1999)
Alberto Lavelli, Joakim Nivre, Johan Hall, Jens Nilsson, MaltParser at the EVALITA 2009 Dependency Parsing Task ,(2009)
Xavier Carreras, Lluís Màrquez, Lluís Padró, Named Entity Extraction using AdaBoost international conference on computational linguistics. pp. 1- 4 ,(2002) , 10.3115/1118853.1118857
James Mayfield, Paul McNamee, Christine Piatko, Named entity recognition using hundreds of thousands of features north american chapter of the association for computational linguistics. pp. 184- 187 ,(2003) , 10.3115/1119176.1119205
Montserrat Arévalo Rodríguez, Montserrat Civit Torruella, Maria Antònia Martí, MICE: a module for Named Entities Recognition and Classification International Journal of Corpus Linguistics. ,vol. 9, pp. 53- 68 ,(2004) , 10.1075/IJCL.9.1.03ARE
Radu Florian, Abe Ittycheriah, Hongyan Jing, Tong Zhang, Named entity recognition through classifier combination north american chapter of the association for computational linguistics. pp. 168- 171 ,(2003) , 10.3115/1119176.1119201
Hai Leong Chieu, Hwee Tou Ng, Named entity recognition with a maximum entropy approach north american chapter of the association for computational linguistics. pp. 160- 163 ,(2003) , 10.3115/1119176.1119199