Towards a mathematical model of word class clusterings

Open Access
Authors
Publication date 2008
Journal Linguistics in Amsterdam
Volume | Issue number 1 | 1
Pages (from-to) 5-35
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
Croft (2001) argues that distributional analysis of word classes is doomed to failure because there is no way to know when to stop splitting word classes into subclasses. This paper discusses mathematical clustering algorithms and shows that contrary to Croft's assumption there exist hard and fast criteria to know when to stop splitting. The method exposed is applied to a subset of English lexemes first proposed by Crystal (1967). Finally, the clustering properties of typologically diverse languages are discussed in the light of the clustering model and checked against current theories of parts-of-speech. The paper concludes by affirming that clusterings can be established for any language but cannot be equated with the classical notion of parts-of-speech.

Document type Article
Published at http://saraswati.ic.uva.nl/cgi/t/text/text-idx?c=aclc;sid=043e5c35f54757192bb216a85ab7b320;idno=m0101a02;view=header
Downloads
Permalink to this page
Back