Decoding by dynamic chunking for statistical machine translation

Authors
Publication date 2009
Host editors
  • L. Gerber
  • P. Isabelle
  • R. Kuhn
  • N. Bemish
  • M. Dillinger
  • M.-J. Goulet
Book title MT Summit XII: Proceedings of the twelfth Machine Translation Summit, Ottawa, Ontario, Canada
Event 12th Machine Translation Summit (MT Summit XII), Ottawa, Ontario, Canada
Pages (from-to) 160-167
Publisher International Association for Machine Translation (IAMT)
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper we present an extension of a phrase-based decoder that dynamically chunks, reorders, and applies phrase translations in tandem. A maximum entropy classifier is trained based on the word alignments to find the best positions to chunk the source sentence. No language specific or syntactic information is used to build the chunking classifier. Words inside the chunks are moved together to enable the decoder to make long-distance re-orderings to capture the word order differences between languages with different sentence structures. To keep the search space manageable, phrases inside the chunks are monotonically translated, thus by eliminating the unnecessary local re-orderings, it is possible to perform long-distance re-orderings beyond the common fixed distortion limit. Experiments on German to English translation are reported.
Document type Conference contribution
Language English
Published at http://www.mt-archive.info/MTS-2009-Yahyaei.pdf
Permalink to this page
Back