Knowledge lean word sense disambiguation

作者: Ted Pedersen

DOI:

关键词:

摘要: We present a corpus-based approach to word-sense disambiguation that only requires information can be automatically extracted from untagged text. use unsupervised techniques estimate the parameters of model describing conditional distribution sense group given known contextual features. Both EM algorithm and Gibbs Sampling are evaluated determine which is most appropriate for our data. compare their accuracy in an experiment with thirteen different words three feature sets. results small but consistent improvement over algorithm.

参考文章(19)
Rebecca Bruce, Ted Pedersen, A new supervised learning algorithm for word sense disambiguation national conference on artificial intelligence. pp. 604- 609 ,(1997)
Hwee Tou Ng, Exemplar-Based Word Sense Disambiguation” Some Recent Improvements empirical methods in natural language processing. ,(1997)
Rebecca F. Bruce, Ted Pedersen, Distinguishing Word Senses in Untagged Text empirical methods in natural language processing. ,(1997)
Raymond J. Mooney, Comparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning empirical methods in natural language processing. ,(1996)
William A. Gale, Kenneth W. Church, David Yarowsky, Discrimination Decisions for 100,000-Dimensional Spaces Current Issues in Computational Linguistics: In Honour of Don Walker. ,vol. 55, pp. 429- 450 ,(1994) , 10.1007/978-0-585-35958-8_22
William A. Gale, Kenneth W. Church, David Yarowsky, A method for disambiguating word senses in a large corpus Computers and The Humanities. ,vol. 26, pp. 415- 439 ,(1992) , 10.1007/BF00136984
Claudia Leacock, Geoffrey Towell, Ellen Voorhees, Corpus-based statistical sense resolution Proceedings of the workshop on Human Language Technology - HLT '93. pp. 260- 265 ,(1993) , 10.3115/1075671.1075730
Stuart Geman, Donald Geman, Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-6, pp. 721- 741 ,(1984) , 10.1109/TPAMI.1984.4767596
Xiao-Li Meng, David Van Dyk, None, The EM Algorithm-an Old Folk-song Sung to a Fast New Tune Journal of the Royal Statistical Society: Series B (Statistical Methodology). ,vol. 59, pp. 511- 567 ,(1997) , 10.1111/1467-9868.00082
A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum Likelihood from Incomplete Data Via theEMAlgorithm Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 1- 22 ,(1977) , 10.1111/J.2517-6161.1977.TB01600.X