作者: Hideki Isozaki , Hajime Tsukada , Taro Watanabe , Jun Suzuki
DOI:
关键词: Evaluation of machine translation 、 Set (abstract data type) 、 Machine translation 、 Rule-based machine translation 、 Computer science 、 Translation (geometry) 、 Pattern recognition 、 Artificial intelligence 、 Margin (machine learning)
摘要: We achieved a state of the art performance in statistical machine translation by using large number features with an online large-margin training algorithm. The millions parameters were tuned only on small development set consisting less than 1K sentences. Experiments Arabic-toEnglish indicated that model trained sparse binary outperformed conventional SMT system features.