Linear Co-occurrence Rate Networks (L-CRNs) for Sequence Labeling

作者: Zhemin Zhu , Djoerd Hiemstra , Peter Apers

DOI: 10.1007/978-3-319-11397-5_14

关键词:

摘要: Sequence labeling has wide applications in natural language processing and speech processing. Popular sequence models suffer from some known problems. Hidden Markov (HMMs) are generative they cannot encode transition features; Conditional (CMMs) the label bias problem; And training of conditional random fields (CRFs) can be expensive. In this paper, we propose Linear Co-occurrence Rate Networks (L-CRNs) for which avoid mentioned problems with existing models. The factors L-CRNs locally normalized trained separately, leads to a simple efficient method. Experimental results on real-world data sets show that reduce time by orders magnitudes while achieve very competitive CRFs.

参考文章(20)
Andreas Wombacher, Peter M.G. Apers, Djoerd Hiemstra, Zhemin Zhu, Separate training for conditional random fields using co-occurrence rate factorization computational linguistics in the netherlands. pp. 96- 96 ,(2012)
Patrick Hanks, Kenneth Ward Church, Word association norms, mutual information, and lexicography Computational Linguistics. ,vol. 16, pp. 22- 29 ,(1990) , 10.5555/89086.89095
Taylor Berg-Kirkpatrick, Alexandre Bouchard-Côté, John DeNero, Dan Klein, Painless Unsupervised Learning with Features north american chapter of the association for computational linguistics. pp. 582- 590 ,(2010)
Andrew McCallum, Dayne Freitag, Fernando C. N. Pereira, Maximum Entropy Markov Models for Information Extraction and Segmentation international conference on machine learning. pp. 591- 598 ,(2000)
Alex J. Smola, Bernhard Schölkopf, A tutorial on support vector regression Statistics and Computing. ,vol. 14, pp. 199- 222 ,(2004) , 10.1023/B:STCO.0000035301.49549.88
Phuong Le-Hong, Xuan-Hieu Phan, The-Trung Tran, On the effect of the label bias problem in part-of-speech tagging The 2013 RIVF International Conference on Computing & Communication Technologies - Research, Innovation, and Vision for Future (RIVF). pp. 103- 108 ,(2013) , 10.1109/RIVF.2013.6719875
Zhemin Zhu, Djoerd Hiemstra, Peter Apers, Andreas Wombacher, Empirical Co-occurrence Rate Networks for Sequence Labeling international conference on machine learning and applications. ,vol. 1, pp. 93- 98 ,(2013) , 10.1109/ICMLA.2013.23
Zoubin Ghahramani, None, An introduction to hidden Markov models and Bayesian networks International Journal of Pattern Recognition and Artificial Intelligence. ,vol. 15, pp. 9- 42 ,(2001) , 10.1142/S0218001401000836
Dan Klein, Christopher D. Manning, Conditional structure versus conditional estimation in NLP models empirical methods in natural language processing. pp. 9- 16 ,(2002) , 10.3115/1118693.1118695