Ontology Population from Textual Mentions: Task Definition and Benchmark

作者: Bernardo Magnini , Emanuele Pianta , Manuela Speranza , Octavian Popescu

DOI:

关键词:

摘要: In this paper we propose and investigate Ontology Population from Textual Mentions (OPTM), a sub-task of text where assume that mentions for several kinds entities (e.g. PERSON, ORGANIZATION , LOCATION GEOPOLITICAL_ ENTITY) are already extracted document collection. On the one hand, OPTM simplifies general task, limiting input textual material; on other it introduces challenging extensions to restricted named entities, being open wider spectrum linguistic phenomena. We describe manually created benchmark discuss factors which determine difficulty task.

参考文章(10)
Abdulrahman Almuhareb, Massimo Poesio, Attribute-Based and Value-Based Clustering: An Evaluation. empirical methods in natural language processing. pp. 158- 165 ,(2004)
P. Buitelaar, P. Cimiano, B. Magnini, Ontology Learning from Text: Methods, Evaluation and Applications ,(2005)
Hamish Cunningham, Kalina Bontcheva, Yaoyong Li, Knowledge management and human language: crossing the chasm Journal of Knowledge Management. ,vol. 9, pp. 108- 131 ,(2005) , 10.1108/13673270510622492
Henri Avancini, Alberto Lavelli, Bernardo Magnini, Fabrizio Sebastiani, Roberto Zanoli, Expanding domain-specific lexicons by term categorization Proceedings of the 2003 ACM symposium on Applied computing - SAC '03. pp. 793- 797 ,(2003) , 10.1145/952532.952690
Dekang Lin, Automatic Retrieval and Clustering of Similar Words meeting of the association for computational linguistics. pp. 768- 774 ,(1998) , 10.3115/980691.980696
Bernardo Magnini, Hristo Tanev, Weakly Supervised Approaches for Ontology Population conference of the european chapter of the association for computational linguistics. ,(2006)
Inderjeet Mani, Laurie Gerber, Lisa Ferro, George Wilson, Beth Sundheim, 2003 Standard for the Annotation of Temporal Expressions ,(2004)
Treebank Penn, Linguistic Data Consortium ,(1999)
Bernardo Magnini, V. Bartalesi, Emanuele Pianta, Rachele Sprugnoli, Lorenza Romano, Christian Girardi, Manuela Speranza, Matteo Negri, I-CAB: the Italian Content Annotation Bank language resources and evaluation. pp. 963- 968 ,(2006)
B. Magnini, E. Pianta, R. Sprugnoli, M. Negri, M. Speranza, A. Lavelli, Italian Content Annotation Bank (I-CAB): Temporal Expressions (v. 1.0) ,(2005)