Efficient parsing with linear context-free rewriting systems

Open Access
Authors
Publication date 2012
Host editors
  • W. Daelemans
Book title EACL 2012: 13th Conference of the European Chapter of the Association for Computational Linguistics
Book subtitle proceedings of the conference : April 23-27 2012, Avignon France
ISBN
  • 9781937284190
Event EACL 2012: 13th Conference of the European Chapter of the Association for Computational Linguistics
Pages (from-to) 460-470
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Faculty of Science (FNWI)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
Previous work on treebank parsing with discontinuous constituents using Linear Context-Free Rewriting systems (LCFRS) has been limited to sentences of up to 30 words, for reasons of computational complexity. There have been some results on binarizing an LCFRS in a manner that minimizes parsing complexity, but the present work shows that parsing long sentences with such an optimally binarized grammar remains infeasible. Instead, we introduce a technique which removes this length restriction, while maintaining a respectable accuracy. The resulting parser has been applied to a discontinuous treebank with favorable results.
Document type Conference contribution
Language English
Published at http://andreasvc.github.io/eacl2012corrected.pdf http://www.aclweb.org/anthology/E/E12/E12-1047.pdf https://dl.acm.org/citation.cfm?id=2380873
Downloads
eacl2012corrected (Accepted author manuscript)
E12-1047 (Final published version)
Permalink to this page
Back