Semi-supervised sequence modeling with syntactic topic models

作者: Andrew McCallum , Wei Li

DOI:

关键词:

摘要: Although there has been significant previous work on semi-supervised learning for classification, relatively little in sequence modeling. This paper presents an approach that leverages recent manifold-learning sequences to discover word clusters from language data, including both syntactic classes and semantic topics. From unlabeled data we form a smooth. low-dimensional feature space, where each token is projected based its underlying role as function or content word. We then use this projection additional input features linear-chain conditional random field trained limited labeled training data. On standard part-of-speech tagging Chinese segmentation sets show much 14% error reduction due the also statistically-significant improvements over related method Miller et al.

参考文章(13)
Alex Zamanian, Scott Miller, Jethran Guinness, Name Tagging with Word Clusters and Discriminative Training north american chapter of the association for computational linguistics. pp. 337- 342 ,(2004)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Richard Sproat, Thomas Emerson, The First International Chinese Word Segmentation Bakeoff Proceedings of the Second SIGHAN Workshop on Chinese Language Processing. pp. 133- 143 ,(2003) , 10.3115/1119250.1119269
Richard H. Byrd, Jorge Nocedal, Robert B. Schnabel, Representations of quasi-Newton matrices and their use in limited memory methods Mathematical Programming. ,vol. 63, pp. 129- 156 ,(1994) , 10.1007/BF01582063
Pat Langley, Editorial: On Machine Learning Machine Learning. ,vol. 1, pp. 5- 10 ,(1986) , 10.1023/A:1022687019898
Kamal Nigam, Andrew Kachites McCallum, Sebastian Thrun, Tom Mitchell, Text Classification from Labeled and Unlabeled Documents using EM Machine Learning. ,vol. 39, pp. 103- 134 ,(2000) , 10.1023/A:1007692713085
Mark Steyvers, Thomas L. Griffiths, Joshua B. Tenenbaum, David M. Blei, Integrating Topics and Syntax neural information processing systems. ,vol. 17, pp. 537- 544 ,(2004)
Vincent J. Della Pietra, Jenifer C. Lai, Robert L. Mercer, Peter F. Brown, Peter V. deSouza, Class-based n -gram models of natural language Computational Linguistics. ,vol. 18, pp. 467- 479 ,(1992) , 10.5555/176313.176316
Xiaojin Zhu, Zoubin Ghahramani, John D Lafferty, None, Semi-supervised learning using Gaussian fields and harmonic functions international conference on machine learning. pp. 912- 919 ,(2003)