Source reordering using MaxEnt classifiers and supertags

Authors
Publication date 2010
Host editors
  • F. Yvon
  • V. Hansen
Book title Proceedings of the 14th Annual Conference of the European Association for Machine Translation (EAMT'10)
Event 14th Annual Conference of the European Association for Machine Translation (EAMT'10), Saint-Raphaƫl, France
Pages (from-to) 292-299
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
Source language reordering can be seen as the preprocessing task of permuting the order of the source words in such a way that the resulting permutation allows as monotone a translation process as possible. We explore a simple but effective source reordering algorithm that works as a cascade of source string transforms, each consisting of swapping the positions of a single pair of adjacent words in order to unfold a candidate pair of crossing alignments. The decision to swap a pair of words is modelled as a binary classification task formulated as a log-linear model and trained under maximum entropy (MaxEnt). We experiment with features that consist of the local neighborhood of both words as well as lexico-syntactic representations known as supertags. Our experiments on the English-to-Dutch EuroParl translation task show that the cascaded alignment unfolding slightly improves the performance of a state-of-the-art phrase translation system that uses distance-based and lexicalized block-oriented reordering.
Document type Conference contribution
Language English
Published at http://www.mt-archive.info/EAMT-2010-Khalilov.pdf
Permalink to this page
Back