Exploitation of machine learning techniques in modelling phrase movements for machine translation

Ni, Yizhao, Saunders, Craig, Szedmak, Sandor and Niranjan, Mahesan (2011) Exploitation of machine learning techniques in modelling phrase movements for machine translation. Journal of Machine Learning Research, 12, 1-30.

Record type: Article

Abstract

We propose a distance phrase reordering model (DPR) for statistical machine translation (SMT), where the aim is to learn the grammatical rules and context dependent changes using a phrase reordering classification framework. We consider a variety of machine learning techniques, including state-of-the-art structured prediction methods. Techniques are compared and evaluated on a Chinese-English corpus, a language pair known for the high reordering characteristics which cannot be adequately captured with current models. In the reordering classification task, the method significantly outperforms the baseline against which it was tested, and further, when integrated as a component of the state-of-the-art machine translation system, MOSES, it achieves improvement in translation results.

Text

__userfiles.soton.ac.uk_Users_nsc_mydesktop_272421.pdf - Version of Record

Download (1MB)