Reordering with Source Language Collocations

Zhanyi Liu1,  Haifeng Wang2,  Hua Wu2,  Ting Liu1,  Sheng Li1
1Harbin Institute of Technology, 2Baidu Inc.


Abstract

This paper proposes a novel reordering model for statistical machine translation (SMT) by means of modeling the translation orders of the source language collocations. The model is learned from a word-aligned bilingual corpus where the collocated words in source sentences are automatically detected. During decoding, the model is employed to softly constrain the translation orders of the source language collocations, so as to constrain the translation orders of those source phrases containing these collocated words. The experimental results show that the proposed method significantly improves the translation quality, achieving the absolute improvements of 1.1~1.4 BLEU score over the baseline methods.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-1104.pdf