Towards a mathematical model of word class clusterings

Authors	S. Nordhoff
Publication date	2008
Journal	Linguistics in Amsterdam
Volume \| Issue number	1 \| 1
Pages (from-to)	5-35
Organisations	Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract	Croft (2001) argues that distributional analysis of word classes is doomed to failure because there is no way to know when to stop splitting word classes into subclasses. This paper discusses mathematical clustering algorithms and shows that contrary to Croft's assumption there exist hard and fast criteria to know when to stop splitting. The method exposed is applied to a subset of English lexemes first proposed by Crystal (1967). Finally, the clustering properties of typologically diverse languages are discussed in the light of the clustering model and checked against current theories of parts-of-speech. The paper concludes by affirming that clusterings can be established for any language but cannot be equated with the classical notion of parts-of-speech.
Document type	Article
Published at	http://saraswati.ic.uva.nl/cgi/t/text/text-idx?c=aclc;sid=043e5c35f54757192bb216a85ab7b320;idno=m0101a02;view=header
Downloads	298074.pdf
Permalink to this page

Back

UvA-DARE