Two-step translation with grammatical post-processing

作者: Rudolf Rosa , David Mareċek , OndÅ™ej Bojar , Petra Galušċáková

DOI:

关键词:

摘要: This paper describes an experiment in which we try to automatically correct mistakes grammatical agreement English Czech MT outputs. We perform several rule-based corrections on sentences parsed dependency trees. prove that it is possible improve the quality of majority systems participating WMT shared task. made both automatic (BLEU) and manual evaluations.

参考文章(11)
Alena Böhmová, Jan Hajič, Eva Hajičová, Barbora Hladká, The Prague Dependency Treebank Treebanks. pp. 103- 127 ,(2003) , 10.1007/978-94-010-0201-1_7
Ondřej Bojar, Zdeněk Žabokrtský, CzEng 0.9: Large Parallel Treebank with Rich Annotation The Prague Bulletin of Mathematical Linguistics. ,vol. 92, pp. 63- 84 ,(2009) , 10.2478/V10108-009-0022-6
Drahomíra "johanka" Spoustová, Jan Hajič, Jan Votrubec, Pavel Krbec, Pavel Květoň, The Best of Two Worlds: Cooperation of Statistical and Rule-Based Taggers for Czech meeting of the association for computational linguistics. pp. 67- 74 ,(2007) , 10.3115/1567545.1567558
Václav Novák, Zdeněk Žabokrtský, Feature engineering in maximum spanning tree dependency parser text speech and dialogue. pp. 92- 98 ,(2007) , 10.1007/978-3-540-74628-7_14
Ondřej Bojar, Kamil Kos, 2010 Failures in English-Czech Phrase-Based MT workshop on statistical machine translation. pp. 60- 66 ,(2010)
Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajič, Non-projective dependency parsing using spanning tree algorithms Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT '05. pp. 523- 530 ,(2005) , 10.3115/1220575.1220641
Cyril Goutte, Michel Simard, Pierre Isabelle, Statistical Phrase-Based Post-Editing north american chapter of the association for computational linguistics. pp. 508- 515 ,(2007)
Martin Popel, Zdeněk Žabokrtský, TectoMT: modular NLP framework international conference natural language processing. pp. 293- 304 ,(2010) , 10.1007/978-3-642-14770-8_33
Franz Josef Och, Hermann Ney, A systematic comparison of various statistical alignment models Computational Linguistics. ,vol. 29, pp. 19- 51 ,(2003) , 10.1162/089120103321337421
András Kornai, Péter Halácsy, Dániel Varga, Tron Viktor, Nemeth Laszlo, Nagy Viktor, Nagy Laszlo, Parallel corpora for medium density languages John Benjamins Publishing Company. pp. 247- 258 ,(2007)