Preference grammars and decoding algorithms for probabilistic synchronous context free grammar based translation

作者: Stephan Vogel , Ashish Venugopal

DOI:

关键词: Context-free grammarDecoding methodsTerminal and nonterminal symbolsRule-based machine translationSynchronous context-free grammarAlgorithmNatural language processingComputer scienceArtificial intelligenceProbabilistic logicPhraseLanguage model

摘要: Probabilistic Synchronous Context-free Grammars (PSCFGs) [Aho and Ullmann, 1969, Wu, 1996] define weighted transduction rules to represent translation reordering operations. When models use features that are defined locally, on each rule, there efficient dynamic programming algorithms perform with these grammars [Kasami, 1965]. In general, the integration of non-local into model can make NP-hard, requiring decoding approximations limit impact features. In this thesis, we consider interaction between two features, n-gram language (LM) labels rule nonterminal symbols in Syntax-Augmented MT (SAMT) grammar [Zollmann Venugopal, 2006]. While do not result NP-hard search, they would lead serious increases wall-clock runtime if naive methods applied. We develop novel two-pass strong during a first pass generating hypergraph sentence spanning derivations. second pass, knowledge about explore for alternative, potentially better translations. We approach integrate LM feature as well syntactic described below. then systematic comparison approaches evaluate relative PSCFG over phrase-based baseline focus labels. This addresses important questions effectiveness variety resource conditions. learn pairs exhibit long distance reordering, deliver improvements comparable systems SAMT additional small, but consistent even conjunction LMs. Finally, propose by extending formalism hard label constraints soft preferences. These preferences used compute new reflects probability derivation is syntactically formed. mitigates effect commonly applied maximum posteriori (MAP) approximation be discriminatively trained concert other report modest quality Chinese-to-English task.

参考文章(87)
Dragomir R. Radev, Sanjeev Khudanpur, Daniel Gildea, Katherine Eng, Alexander M. Fraser, Shankar Kumar, Anoop Sarkar, Zhen Jin, Libin Shen, Franz Josef Och, Kenji Yamada, David Smith, Viren Jain, A Smorgasbord of Features for Statistical Machine Translation north american chapter of the association for computational linguistics. pp. 161- 168 ,(2004)
Alex Waibel, Stephan Vogel, Alicia Tribble, Bing Zhao, Ashish Venugopal, Ying Zhang, Fei Huang, The CMU Statistical Machine Translation System ,(2003)
Philipp Koehn, Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models conference of the association for machine translation in the americas. pp. 115- 124 ,(2004) , 10.1007/978-3-540-30194-3_13
Christoph Tillmann, Franz Josef Och, Hermann Ney, Improved Alignment Models for Statistical Machine Translation empirical methods in natural language processing. ,(1999)
Khalil Sima'an, Computational complexity of probabilistic disambiguation Grammars. ,vol. 5, pp. 125- 151 ,(2002) , 10.1023/A:1016340700671
Dan Klein, Christopher D. Manning, Parsing and hypergraphs New developments in parsing technology. pp. 351- 372 ,(2004) , 10.1007/1-4020-2295-6_18
Eugene Charniak, A maximum-entropy-inspired parser north american chapter of the association for computational linguistics. pp. 132- 139 ,(2000)
S Pallottino, G Gallo, G Longo, S Nguyen, DIRECTED HYPERGRAPHS AND APPLICATIONS CENTRE DE RECHERCHE SUR LES TRANSPORTS PUBLICATION. ,(1992)
Michel Galley, Mark Hopkins, Kevin Knight, Daniel Marcu, What’s in a translation rule? north american chapter of the association for computational linguistics. pp. 273- 280 ,(2004) , 10.21236/ADA460212