Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations

作者: Patrick Pantel , Marco Pennacchiotti

DOI: 10.3115/1220175.1220190

关键词:

摘要: In this paper, we present Espresso, a weakly-supervised, general-purpose, and accurate algorithm for harvesting semantic relations. The main contributions are: i) method exploiting generic patterns by filtering incorrect instances using the Web; ii) principled measure of pattern instance reliability enabling algorithm. We an empirical comparison Espresso with various state art systems, on different size genre corpora, extracting general specific Experimental results show that our exploitation substantially increases system recall small effect overall precision.

参考文章(23)
Deepak Ravichandran, Patrick Pantel, Automatically Labeling Semantic Classes north american chapter of the association for computational linguistics. pp. 321- 328 ,(2004)
Hristo Tanev, Ido Dagan, Idan Szpektor, Bonaventura Coppola, Scaling Web-based Acquisition of Entailment Relations. empirical methods in natural language processing. pp. 41- 48 ,(2004)
Sanda M. Harabagiu, Marius Pa_ca, Shauna Eggers, The Informative Role of WordNet in Open-Domain Question Answering ,(2004)
Doug Downey, Oren Etzioni, Stephen Soderland, A probabilistic model of redundancy in information extraction international joint conference on artificial intelligence. pp. 1034- 1041 ,(2005) , 10.21236/ADA454763
Ellen Riloff, Jessica Shepherd, A Corpus-Based Approach for Building Semantic Lexicons. empirical methods in natural language processing. ,(1997)
Adam Kilgarriff, Christiane Fellbaum, WordNet : an electronic lexical database Language. ,vol. 76, pp. 706- ,(2000) , 10.2307/417141
David Day, John Aberdeen, Lynette Hirschman, Robyn Kozierok, Patricia Robinson, Marc Vilain, Mixed-Initiative Development of Language Processing Systems conference on applied natural language processing. pp. 348- 355 ,(1997) , 10.3115/974557.974608
Dekang Lin, Patrick Pantel, Concept discovery from text Proceedings of the 19th international conference on Computational linguistics -. pp. 1- 7 ,(2002) , 10.3115/1072228.1072372
Marti A. Hearst, Automatic acquisition of hyponyms from large text corpora Proceedings of the 14th conference on Computational linguistics -. pp. 539- 545 ,(1992) , 10.3115/992133.992154