Machine Learning for Natural Language Processing

作者: Martin Emms

DOI:

关键词:

摘要: Over the past decade Machine Learning techniques has become an essentialtool for Natural Language Processing. This introductory course will coverthe basics of and present a selection widely used al-gorithms, illustrating them with practical applications to LanguageProcessing. The start survey main concepts inMachine Learning, in terms decisions one needs make whendesigning application, including: type training (su-pervised, unsupervised, active learning etc), data representation, choice andrepresentation target function, algorithm. befollowed by case studies designed illustrate theseconcepts. Unsupervised (clustering) be describedand illustrated through tasks such as thesaurus induction,document class inference, term extraction text classification. Super-vised covering symbolic (e.g. decision trees) non-symbolic approaches probabilistic classifiers, instance-based classifiers,support vector machines) presented on clas-sification word sense disambiguation analysed some detail. Finally,we address issue functions which assign struc-tures linear sequences Hidden Markov Modelsto part-of-speech tagging Probabilistic Grammars parsing naturallanguage.

参考文章(67)
Christopher D. Manning, Dan Klein, The unsupervised learning of natural language structure Stanford University. ,(2005)
Saturnino Luz, Daniel Kelleher, Automatic hypertext keyphrase detection international joint conference on artificial intelligence. pp. 1608- 1609 ,(2005)
Jun'ichi Tsujii, Yusuke Miyao, Jun'ichi Kazama, A Maximum Entropy Tagger with Unsupervised Hidden Markov Models. NLPRS. pp. 333- 340 ,(2001)
Eugene Charniak, Glenn Carroll, Two Experiments on Learning Probabilistic Dependency Grammars from Corpora Brown University. ,(1992)
Michael J. Pazzani, Pedro M. Domingos, Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier. international conference on machine learning. pp. 105- 112 ,(1996)
Masahiko Haruno, Satoshi Shirai, Yoshifumi Ooyama, Using Decision Trees to Construct a Practical Parser Machine Learning. ,vol. 34, pp. 131- 149 ,(1999) , 10.1023/A:1007597902467
Q.I. Wang, D. Schuurmans, Improved estimation for unsupervised part-of-speech tagging international conference natural language processing. pp. 219- 224 ,(2005) , 10.1109/NLPKE.2005.1598738
Kamal Nigam, Andrew McCallum, A comparison of event models for naive bayes text classification national conference on artificial intelligence. pp. 41- 48 ,(1998)
Jakub Zavrel, Peter Berck, Steven Gillis, Walter Daelemans, MBT: A Memory-Based Part of Speech Tagger-Generator international conference on computational linguistics. pp. 14- 27 ,(1996)
Eugene Charniak, Statistical parsing with a context-free grammar and word statistics national conference on artificial intelligence. pp. 598- 603 ,(1997)