作者: Jianfeng Gao , Jian-Yun Nie , Endong Xun , Jian Zhang , Ming Zhou
关键词: Noun phrase 、 Synchronous context-free grammar 、 Evaluation of machine translation 、 Artificial intelligence 、 Machine translation software usability 、 Machine translation 、 Natural language processing 、 Transfer-based machine translation 、 Rule-based machine translation 、 Information retrieval 、 Example-based machine translation 、 Computer science 、 Phrase 、 Computer-assisted translation 、 Cross-language information retrieval 、 Query expansion
摘要: Dictionaries have often been used for query translation in cross-language information retrieval (CLIR). However, we are faced with the problem of ambiguity, i.e. multiple translations stored a dictionary word. In addition, word-by-word is not precise enough. this paper, explore several methods to improve previous dictionary-based translation. First, as many possible, noun phrases recognized and translated whole by using statistical models phrase patterns. Second, best word selected based on cohesion words. Our experimental results TREC English-Chinese CLIR collection show that these techniques result significant improvements over simple approaches, achieve even better performance than high-quality machine system.