Word Sense Induction & Disambiguation Using Hierarchical Random Graphs

作者: Ioannis Klapaftis , Suresh Manandhar

DOI:

关键词:

摘要: Graph-based methods have gained attention in many areas of Natural Language Processing (NLP) including Word Sense Disambiguation (WSD), text summarization, keyword extraction and others. Most the work these formulate their problem a graph-based setting apply unsupervised graph clustering to obtain set clusters. Recent studies suggest that graphs often exhibit hierarchical structure goes beyond simple flat clustering. This paper presents an method for inferring grouping senses polysemous word. The inferred structures are applied word sense disambiguation, where we show our performs significantly better than traditional agglomerative yielding improvements over state-of-the-art WSD systems based on induction.

参考文章(27)
Ioannis P. Klapaftis, Suresh Manandhar, Word Sense Induction Using Graphs of Collocations european conference on artificial intelligence. pp. 298- 302 ,(2008)
Ioannis Klapaftis, Suresh Manandhar, Sameer Pradhan, Dmitriy Dligach, SemEval-2010 Task 14: Word Sense Induction & Disambiguation meeting of the association for computational linguistics. pp. 63- 68 ,(2010)
Benjamin Snyder, Martha Palmer, The English all-words task meeting of the association for computational linguistics. pp. 41- 43 ,(2004)
G. T. Barkema, M. E. J. Newman, Monte Carlo methods in statistical physics ,(1979)
Ioannis P. Klapaftis, Suresh Manandhar, Taxonomy Learning Using Word Sense Induction north american chapter of the association for computational linguistics. pp. 82- 90 ,(2010)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Julie Weeds, David Weir, Diana McCarthy, Characterising measures of lexical distributional similarity Proceedings of the 20th international conference on Computational Linguistics - COLING '04. pp. 1015- 1021 ,(2004) , 10.3115/1220355.1220501
Samuel Brody, Mirella Lapata, Bayesian Word Sense Induction meeting of the association for computational linguistics. pp. 103- 111 ,(2009) , 10.3115/1609067.1609078
Eneko Agirre, Aitor Soroa, SemEval-2007 Task 02: Evaluating Word Sense Induction and Discrimination Systems meeting of the association for computational linguistics. pp. 7- 12 ,(2007) , 10.3115/1621474.1621476
Daniel Ramage, Anna N. Rafferty, Christopher D. Manning, Random Walks for Text Semantic Similarity graph based methods for natural language processing. pp. 23- 31 ,(2009) , 10.3115/1708124.1708131