Machine Translation Quality Estimation: Applications and Future Perspectives

作者: Lucia Specia , Kashif Shah

DOI: 10.1007/978-3-319-91241-7_10

关键词:

摘要: Predicting the quality of machine translation (MT) output is a topic that has been attracting significant attention. By automatically distinguishing bad from good translations, it potential to make MT more useful in number applications. In this chapter we review various practical applications where estimation (QE) at sentence level shown positive results: filtering low cases post-editing, selecting best system when multiple options are available, improving performance by additional parallel data, and sampling for assurance humans. Finally, discuss QE other levels (word document) general challenges field, as well perspectives novel directions

参考文章(37)
Vamshi Ambati, Jaime G. Carbonell, Stephan Vogel, Active Learning and Crowd-Sourcing for Machine Translation language resources and evaluation. pp. 2169- 2174 ,(2010)
Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, Prem Natarajan, None, Discriminative Sample Selection for Statistical Machine Translation empirical methods in natural language processing. pp. 626- 635 ,(2010)
V. I. Levenshtein, Binary codes capable of correcting deletions, insertions, and reversals Soviet physics. Doklady. ,vol. 10, pp. 707- 710 ,(1966)
Christopher K I Williams, Carl Edward Rasmussen, Gaussian Processes for Machine Learning ,(2005)
Kashif Shah, Trevor Cohn, Lucia Specia, A Bayesian non-linear method for feature selection in machine translation quality estimation Machine Translation. ,vol. 29, pp. 101- 125 ,(2015) , 10.1007/S10590-014-9164-X
Lucia Specia, Dhwaj Raj, Marco Turchi, Machine translation evaluation versus quality estimation Machine Translation. ,vol. 24, pp. 39- 50 ,(2010) , 10.1007/S10590-010-9077-2
Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu, BLEU Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02. pp. 311- 318 ,(2001) , 10.3115/1073083.1073135
Abdessamad Echihabi, Radu Soricut, TrustRank: Inducing Trust in Automatic Translations via Ranking meeting of the association for computational linguistics. pp. 612- 621 ,(2010)
Gholamreza Haffari, Maxim Roy, Anoop Sarkar, Active Learning for Statistical Phrase-based Machine Translation north american chapter of the association for computational linguistics. pp. 415- 423 ,(2009) , 10.3115/1620754.1620815
Philipp Koehn, Richard Zens, Chris Dyer, Ondřej Bojar, Alexandra Constantin, Evan Herbst, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Moses: Open Source Toolkit for Statistical Machine Translation meeting of the association for computational linguistics. pp. 177- 180 ,(2007) , 10.3115/1557769.1557821