Using a Large Monolingual Corpus to Improve Translation Accuracy

作者: Radu Soricut , Kevin Knight , Daniel Marcu

DOI: 10.1007/3-540-45820-4_16

关键词:

摘要: The existence of a phrase in large monolingual corpus is very useful information, and so its frequency. We introduce an alternative approach to automatic translation phrases/sentences that operationalizes this observation. use statistical machine system produce translations (re)rank these translations. Our results show combination yields better translations, especially when translating out-of-domain phrases/sentences. can be also used automatically construct parallel corpora from resources.

参考文章(6)
Vincent J. Della Pietra, Stephen A. Della Pietra, Robert L. Mercer, Peter F. Brown, The mathematics of statistical machine translation: parameter estimation Computational Linguistics. ,vol. 19, pp. 263- 311 ,(1993)
Kevin Knight, Yaser Al-Onaizan, Translation with Finite-State Devices conference of the association for machine translation in the americas. pp. 421- 437 ,(1998) , 10.1007/3-540-49478-2_38
Ulrich Germann, Michael Jahr, Kevin Knight, Daniel Marcu, Kenji Yamada, Fast decoding and optimal decoding for machine translation Proceedings of the 39th Annual Meeting on Association for Computational Linguistics - ACL '01. pp. 228- 235 ,(2001) , 10.3115/1073012.1073042