Testing the Processing Hypothesis of word order variation using a probabilistic language model

Open Access
Authors
Publication date 2016
Host editors
  • D. Brunato
  • F. Dell'Orletta
  • G. Venturi
  • T. François
  • P. Blache
Book title CL4LC 2016 : Computational Linguistics for Linguistic Complexity
Book subtitle proceedings of he workshop : December 11, 2016, Osaka, Japan
ISBN (electronic)
  • 9784879747099
Event Computational Linguistics for Linguistic Complexity
Pages (from-to) 174-185
Publisher The COLING 2016 Organizing Committee
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
This work investigates the application of a measure of surprisal to modeling a grammatical variation phenomenon between near-synonymous constructions. We investigate a particular variation phenomenon, word order variation in Dutch two-verb clusters, where it has been established that word order choice is affected by processing cost. Several multifactorial corpus studies of Dutch verb clusters have used other measures of processing complexity to show that this factor affects word order choice. This previous work allows us to compare the surprisal measure, which is based on constraint satisfaction theories of language modeling, to those previously used measures, which are more directly linked to empirical observations of processing complexity. Our results show that surprisal does not predict the word order choice by itself, but is a significant predictor when used in a measure of uniform information density (UID). This lends support to the view that human language processing is facilitated not so much by predictable sequences of words but more by sequences of words in which information is spread evenly.
Document type Conference contribution
Language English
Published at https://www.aclweb.org/anthology/W16-4120/
Downloads
W16-4120 (Final published version)
Permalink to this page
Back