Exploring the relation between semantic complexity and quantifier distribution in large corpora
| Authors |
|
|---|---|
| Publication date | 03-2017 |
| Journal | Language Sciences |
| Volume | Issue number | 60 |
| Pages (from-to) | 80–93 |
| Organisations |
|
| Abstract |
In this paper we study if semantic complexity can influence the distribution of generalized quantifiers in a large English corpus derived from Wikipedia. We consider the minimal computational device recognizing a generalized quantifier as the core measure of its semantic complexity. We regard quantifiers that belong to three increasingly more complex classes: Aristotelian (recognizable by 2-state acyclic finite automata), counting (k+2-state finite automata), and proportional quantifiers (pushdown automata). Using regression analysis we show that semantic complexity is a statistically significant factor explaining 27.29% of frequency variation. We compare this impact to that of other known sources of complexity, both semantic (quantifier monotonicity and the comparative/superlative distinction) and superficial (e.g., the length of quantifier surface forms). In general, we observe that the more complex a quantifier, the less frequent it is.
|
| Document type | Article |
| Note | In special issue: Complexity in human languages: A multifaceted approach |
| Language | English |
| Published at | https://doi.org/10.1016/j.langsci.2017.01.006 |
| Permalink to this page | |
