Knowledge-based WSD on specific domains: performing better than generic supervised WSD

作者： Eneko Agirre , Aitor Soroa , Oier Lopez De Lacalle

DOI:

关键词: WordNet 、 Artificial intelligence 、 Graph (abstract data type) 、 Word-sense disambiguation 、 Natural language processing 、 Computer science

摘要: This paper explores the application of knowledge-based Word Sense Disambiguation systems to specific domains, based on our state-of-the-art graph-based WSD system that uses information in WordNet. Evaluation was performed over a publicly available domain-specific dataset 41 words related Sports and Finance, comprising examples drawn from three corpora: one balanced corpus (BNC), two corpora (news Finance). The results show all algorithm improves previous results, also supervised trained SemCor, largest annotated corpus. We using as context, instead actual occurrence contexts, yields better domain datasets, but not general one. Interestingly, are higher for than corpus, raising prospects improving current when applied domains.

ijcai.org 本地加速

ijcai.org PDF 下载加速

uni-trier.de PDF 下载加速

参考文章(15)

Benjamin Snyder, Martha Palmer, The English all-words task meeting of the association for computational linguistics. pp. 41- 43 ,(2004)

Roberto Navigli, Mirella Lapata, Graph connectivity measures for unsupervised word sense disambiguation international joint conference on artificial intelligence. pp. 1683- 1688 ,(2007)

Gerard Escudero, Lluís Màrquez, German Rigau, An Empirical Study of the Domain Dependence of Supervised Word Disambiguation Systems empirical methods in natural language processing. pp. 172- 180 ,(2000) , 10.3115/1117794.1117816

Eneko Agirre, Oier Lopez de Lacalle, Supervised Domain Adaption for WSD meeting of the association for computational linguistics. pp. 42- 50 ,(2009) , 10.3115/1609067.1609071

Sameer Pradhan, Edward Loper, Dmitriy Dligach, Martha Palmer, None, SemEval-2007 Task-17: English Lexical Sample, SRL and All Words meeting of the association for computational linguistics. pp. 87- 92 ,(2007) , 10.3115/1621474.1621490

Eneko Agirre, Aitor Soroa, Personalizing PageRank for Word Sense Disambiguation meeting of the association for computational linguistics. pp. 33- 41 ,(2009) , 10.3115/1609067.1609070

Eneko Agirre, Oier Lopez de Lacalle, On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation international conference on computational linguistics. pp. 17- 24 ,(2008) , 10.3115/1599081.1599084

Rob Koeling, Diana McCarthy, John Carroll, Domain-specific sense distributions and predominant sense acquisition Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 419- 426 ,(2005) , 10.3115/1220575.1220628

George A. Miller, Claudia Leacock, Randee Tengi, Ross T. Bunker, A semantic concordance Proceedings of the workshop on Human Language Technology - HLT '93. pp. 303- 308 ,(1993) , 10.3115/1075671.1075742

10.

Sergey Brin, Lawrence Page, The anatomy of a large-scale hypertextual Web search engine the web conference. ,vol. 30, pp. 107- 117 ,(1998) , 10.1016/S0169-7552(98)00110-X

Knowledge-based WSD on specific domains: performing better than generic supervised WSD

来源期刊

我的账户

Knowledge-based WSD on specific domains: performing better than generic supervised WSD

来源期刊

相似文章 10

我的账户