University of AmsterdamUniversity of AmsterdamUvA

  • Terms of use
  • Contact

UvA-DARE (Digital Academic Repository)

  • Home
  • Advanced Search
  • Browse
  • My selection

Search UvA-DARE

Author
E. Garmash
Title
Exploring the correspondence between languages for machine translation
Supervisors
M. de Rijke
Co-supervisors
C. Monz
Award date
12 December 2017
Number of pages
137
ISBN
978-94-6182-853-8
Document type
PhD thesis
Faculty
Faculty of Science (FNWI)
Institute
Informatics Institute (IVI)
Abstract
The central topic of this thesis is the exploration of properties that are common across languages and properties that differentiate them, in the context of machine translation. The field of machine translation aims to render the content expressed in one language into another language. Therefore, the difficulty of this task is proportional to how different the given two languages are in expressing the same information. In the first part of the thesis, we focus on properties common across languages. We aim to validate the hypothesis that every language has a level of representation which is shared, to some extent, by all languages. This level of representation is the syntactic structure of a sentence. Consequently, one can narrow down the search for translation correspondences between units of any two languages. We realize these ideas in the form of bilingual syntactic language models which are used as soft constraints during the translation process. The models provide improvements in translation quality in a series of rescoring and decoding experiments. In the second part of the thesis, we use the idea that despite the observation that all natural languages share certain ways of expressing content, there are also many features that differentiate them. We hypothesize that the differences between languages are systematic. We design ensembles of neural machine translation systems sharing the target language but differing in their source languages. We perform a comparison to monolingual ensembles obtained by initializing systems with different random seeds and observe systematically better performance for the multi-source ensembles.
Permalink
http://hdl.handle.net/11245.1/2637f76e-951e-40c4-aa25-b51f05c1eba7
Downloads
  • Thesis

Disclaimer/Complaints regulations

If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.

PrintPrint this pageShareShare via emailShare on facebookShare on linkedinShare on twitter
  • University library
  • About UvA-DARE
  • Disclaimer
Copyright UvA 2014