Decoding by dynamic chunking for statistical machine translation
| Authors |
|
|---|---|
| Publication date | 2009 |
| Host editors |
|
| Book title | MT Summit XII: Proceedings of the twelfth Machine Translation Summit, Ottawa, Ontario, Canada |
| Event | 12th Machine Translation Summit (MT Summit XII), Ottawa, Ontario, Canada |
| Pages (from-to) | 160-167 |
| Publisher | International Association for Machine Translation (IAMT) |
| Organisations |
|
| Abstract |
In this paper we present an extension of a phrase-based decoder that dynamically chunks, reorders, and applies phrase translations in tandem. A maximum entropy classifier is trained based on the word alignments to find the best positions to chunk the source sentence. No language specific or syntactic information is used to build the chunking classifier. Words inside the chunks are moved together to enable the decoder to make long-distance re-orderings to capture the word order differences between languages with different sentence structures. To keep the search space manageable, phrases inside the chunks are monotonically translated, thus by eliminating the unnecessary local re-orderings, it is possible to perform long-distance re-orderings beyond the common fixed distortion limit. Experiments on German to English translation are reported.
|
| Document type | Conference contribution |
| Language | English |
| Published at | http://www.mt-archive.info/MTS-2009-Yahyaei.pdf |
| Permalink to this page | |