Knowledge-based WSD on specific domains: performing better than generic supervised WSD

作者: Eneko Agirre , Aitor Soroa , Oier Lopez De Lacalle

DOI:

关键词: WordNetArtificial intelligenceGraph (abstract data type)Word-sense disambiguationNatural language processingComputer science

摘要: This paper explores the application of knowledge-based Word Sense Disambiguation systems to specific domains, based on our state-of-the-art graph-based WSD system that uses information in WordNet. Evaluation was performed over a publicly available domain-specific dataset 41 words related Sports and Finance, comprising examples drawn from three corpora: one balanced corpus (BNC), two corpora (news Finance). The results show all algorithm improves previous results, also supervised trained SemCor, largest annotated corpus. We using as context, instead actual occurrence contexts, yields better domain datasets, but not general one. Interestingly, are higher for than corpus, raising prospects improving current when applied domains.

参考文章(15)
Benjamin Snyder, Martha Palmer, The English all-words task meeting of the association for computational linguistics. pp. 41- 43 ,(2004)
Roberto Navigli, Mirella Lapata, Graph connectivity measures for unsupervised word sense disambiguation international joint conference on artificial intelligence. pp. 1683- 1688 ,(2007)
Gerard Escudero, Lluís Màrquez, German Rigau, An Empirical Study of the Domain Dependence of Supervised Word Disambiguation Systems empirical methods in natural language processing. pp. 172- 180 ,(2000) , 10.3115/1117794.1117816
Eneko Agirre, Oier Lopez de Lacalle, Supervised Domain Adaption for WSD meeting of the association for computational linguistics. pp. 42- 50 ,(2009) , 10.3115/1609067.1609071
Sameer Pradhan, Edward Loper, Dmitriy Dligach, Martha Palmer, None, SemEval-2007 Task-17: English Lexical Sample, SRL and All Words meeting of the association for computational linguistics. pp. 87- 92 ,(2007) , 10.3115/1621474.1621490
Eneko Agirre, Aitor Soroa, Personalizing PageRank for Word Sense Disambiguation meeting of the association for computational linguistics. pp. 33- 41 ,(2009) , 10.3115/1609067.1609070
Eneko Agirre, Oier Lopez de Lacalle, On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation international conference on computational linguistics. pp. 17- 24 ,(2008) , 10.3115/1599081.1599084
Rob Koeling, Diana McCarthy, John Carroll, Domain-specific sense distributions and predominant sense acquisition Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 419- 426 ,(2005) , 10.3115/1220575.1220628
George A. Miller, Claudia Leacock, Randee Tengi, Ross T. Bunker, A semantic concordance Proceedings of the workshop on Human Language Technology - HLT '93. pp. 303- 308 ,(1993) , 10.3115/1075671.1075742
Sergey Brin, Lawrence Page, The anatomy of a large-scale hypertextual Web search engine the web conference. ,vol. 30, pp. 107- 117 ,(1998) , 10.1016/S0169-7552(98)00110-X