A Formal Characterization of Parsing Word Alignments by Synchronous Grammars with Empirical Evidence to the ITG Hypothesis

G. Maillette de Buij Wenniger; K. Sima'an

A Formal Characterization of Parsing Word Alignments by Synchronous Grammars with Empirical Evidence to the ITG Hypothesis

Authors	G. Maillette de Buij Wenniger K. Sima'an
Publication date	2013
Host editors	M. Carpuat L. Specia D. Wu
Book title	Proceedings of SSST-7 : Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation
Book subtitle	SIGMT/SIGLEX Workshop : The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
ISBN	9781937284473
Event	Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-7)
Pages (from-to)	58-67
Publisher	Stroudsburg, PA: The Association for Computational Linguistics
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Deciding whether a synchronous grammar formalism generates a given word alignment (the alignment coverage problem) depends on finding an adequate instance grammar and then using it to parse the word alignment. But what does it mean to parse a word alignment by a synchronous grammar? This is formally undefined until we define an unambiguous mapping between grammatical derivations and word-level alignments. This paper proposes an initial, formal characterization of alignment coverage as intersecting two partially ordered sets (graphs) of translation equivalence units, one derived by a grammar instance and another defined by the word alignment. As a first sanity check, we report extensive coverage results for ITG on automatic and manual alignments. Even for the ITG formalism, our formal characterization makes explicit many algorithmic choices often left underspecified in earlier work.
Document type	Conference contribution
Language	English
Published at	http://aclweb.org/anthology/W/W13/W13-0807.pdf
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

A Formal Characterization of Parsing Word Alignments by Synchronous Grammars with Empirical Evidence to the ITG Hypothesis