Integrating CAT and MT in AnglaBhart-II architecture.

作者: R. Mahesh , K. Sinha

DOI:

关键词:

摘要: Machine translation (MT) is a complex and difficult task. It not possible to achieve human competing performance with the present state of technology. Automating process natural languages requires number knowledge sources their appropriate invocation in engine. A practical machine system limited resources cannot embody all that beings use. However, an MT can be considerably improved if automated integrated supporting modules provide synergy for arriving at correct translations. The computer assisted tools (CAT) identify limitations clues cope up them, constitute important module enhancing performance. This paper presents details AnglaBharti-II architecture highlighting role CAT system. translating English Indian languages. AnglaBharti primarily rule-based (RBMT). input sentence transformed pseudo-interlingual structure called PLIL (Pseudo Lingua Languages) using CFG like pattern directed rule-base. RBMT its own dealing real-life texts. We have tried overcome some these by integrating additional modules. These are basically incorporating memory, raw generalized example-bases, interactive pre-editing, paraphrasing, failure analysis heuristics attempt deal variety constructs frequently encountered real life text.

参考文章(15)
Anil Thakur, R. Mahesh, K. Sinha, Translation divergence in English-Hindi MT Proceedings of the 10th EAMT Conference: Practical applications of machine translation. ,(2005)
Mitsuo Shimohata, Acquiring Paraphrases from Corpora and Its Application to Machine Translation Nara Institute of Science and Technology. ,(2004)
Makoto Nagao, A framework of a mechanical translation between Japanese and English by analogy principle Proc. of the international NATO symposium on Artificial and human intelligence. pp. 173- 180 ,(1984)
James Allen, Natural Language Understanding ,(1987)
Marc Dymetman, Elliott Macklovitch, Francois Perrault, Michel Simard, Jean-Marc Jutras, Xiaobo Ren, George Foster, Pierre Isabelle, Translation analysis and translation automation conference of the centre for advanced studies on collaborative research. pp. 1133- 1147 ,(1993)
Joseph D. Becker, The phrasal lexicon Proceedings of the 1975 workshop on Theoretical issues in natural language processing - TINLAP '75. pp. 60- 63 ,(1975) , 10.3115/980190.980212
Satoshi Sato, Makoto Nagao, Toward memory-based translation international conference on computational linguistics. pp. 247- 252 ,(1990) , 10.3115/991146.991190
Rodney Huddleston, Geoffrey K. Pullum, The Cambridge Grammar of the English Language ,(2002)
Kazuhide Yamamoto, Machine translation by interaction between paraphraser and transfer Proceedings of the 19th international conference on Computational linguistics -. pp. 1- 7 ,(2002) , 10.3115/1072228.1072391