- Author
- Year
- 2014
- Title
- Dependency-Based Bilingual Language Models for Reordering in Statistical Machine Translation
- Event
- 2014 Empirical Methods in Natural Language Processing (EMNLP)
- Book/source title
- EMNLP 2014: the 2014 Conference on Empirical Methods in Natural Language Processing: proceedings of the conference: October 25-29, 2014, Doha, Qatar
- Pages (from-to)
- 1689-1700
- Publisher
- Stroudsburg, PA: Association for Computational Linguistics
- ISBN
- 9781937284961
- Document type
- Conference contribution
- Faculty
- Faculty of Science (FNWI)
- Institute
- Informatics Institute (IVI)
- Abstract
-
This paper presents a novel approach to improve reordering in phrase-based machine translation by using richer, syntactic representations of units of bilingual language models (BiLMs). Our method to include syntactic information is simple in implementation and requires minimal changes in the decoding algorithm. The approach is evaluated in a series of Arabic-English and Chinese-English translation experiments. The best models demonstrate significant improvements in BLEU and TER over the phrase-based baseline, as well as over the lexicalized BiLM by Niehues et al. (2011). Further improvments of up to 0.45 BLEU for Arabic-English and up to 0.59 BLEU for Chinese-English are obtained by combining our dependency BiLM with a lexicalized BiLM. An improvement of 0.98 BLEU is obtained for Chinese-English in the setting of an increased distortion limit.
- Link
- Link
- Language
- English
- Permalink
- http://hdl.handle.net/11245/1.439254
Disclaimer/Complaints regulations
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.