The Best of Two Worlds: Cooperation of Statistical and Rule-Based Taggers for Czech

作者: Drahomíra "johanka" Spoustová , Jan Hajič , Jan Votrubec , Pavel Krbec , Pavel Květoň

DOI: 10.3115/1567545.1567558

关键词:

摘要: Several hybrid disambiguation methods are described which combine the strength of hand-written rules and statistical taggers. Three different (HMM, Maximum-Entropy Averaged Perceptron) taggers used in a tagging experiment using Prague Dependency Tree-bank. The results systems better than any other method tried for Czech so far.

参考文章(20)
Lars Borin, Something Borrowed, Something Blue: Rule-Based Combination of POS Taggers language resources and evaluation. pp. 21- 26 ,(2000)
Jakub Zavrel, Peter Berck, Steven Gillis, Walter Daelemans, MBT: A Memory-Based Part of Speech Tagger-Generator international conference on computational linguistics. pp. 14- 27 ,(1996)
Karel Oliva, Milena Hnátková, Vladimír Petkevič, Pavel Květoň, The Linguistic Basis of a Rule-Based Tagger of Czech text speech and dialogue. pp. 3- 8 ,(2000) , 10.1007/3-540-45323-7_1
Adwait Ratnaparkhi, A Maximum Entropy Model for Part-Of-Speech Tagging empirical methods in natural language processing. ,(1996)
Jan Hajič, Morphological tagging: data vs. dictionaries north american chapter of the association for computational linguistics. pp. 94- 101 ,(2000)
O. Morgenthaler, Proceedings of the Conference Bee World. ,vol. 11, pp. 49- 50 ,(1930) , 10.1080/0005772X.1930.11092929
Kimmo Koskenniemi, Finite-state parsing and disambiguation Proceedings of the 13th conference on Computational linguistics -. pp. 229- 232 ,(1990) , 10.3115/997939.997979
Noah A. Smith, David A. Smith, Roy W. Tromble, Context-based morphological disambiguation with random fields Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 475- 482 ,(2005) , 10.3115/1220575.1220635