- Hierarchical Alignment Decomposition Labels for Hiero Grammar Rules
- Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-7)
- Book/source title
- Proceedings of SSST-7 : Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation
- Book/source subtitle
- SIGMT/SIGLEX Workshop : The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
- Pages (from-to)
- Stroudsburg, PA: The Association for Computational Linguistics
- Document type
- Conference contribution
- Interfacultary Research Institutes
- Institute for Logic, Language and Computation (ILLC)
Selecting a set of nonterminals for the synchronous CFGs underlying the hierarchical phrase-based models is usually done on the basis of a monolingual resource (like a syntactic parser).
However, a standard bilingual resource like word alignments is itself rich with reordering patterns that, if clustered somehow, might provide labels of different (possibly complementary) nature to monolingual labels. In this paper we explore a first version of this idea based on a hierarchical decomposition of word alignments into recursive tree representations.
We identify five clusters of alignment patterns in which the children of a node in a decomposition tree are found and employ these five as nonterminal labels for the Hiero productions. Although this is our first non-optimized instantiation of the idea, our experiments show competitive performance with the Hiero baseline, exemplifying certain merits of this novel approach.
- Final publisher version
If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.