Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions

Open Access
Authors
Publication date 2012
Host editors
  • N. Calzolari
  • K. Choukri
  • T. Declerck
  • M. Uğur Doğan
  • B. Maegaard
  • J. Mariani
  • J. Odijk
  • S. Piperidis
Book title Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Book subtitle 23-25 May, 2012, Istanbul, Turkey
ISBN
  • 9782951740877
Event 8th International Conference on Language Resources and Evaluation (LREC'12)
Pages (from-to) 1511-1515
Publisher Paris: European Language Resources Association (ELRA)
Organisations
  • Faculty of Humanities (FGw)
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Science (FNWI)
Abstract
Natural languages possess a wealth of indefinite forms that typically differ in distribution and interpretation. Although formal semanticists have strived to develop precise meaning representations for different indefinite functions, to date there has hardly been any corpus work on the topic. In this paper, we present the results of a small corpus study where English indefinite forms any and some were labelled with fine-grained semantic functions well-motivated by typological studies. We developed annotation guidelines that could be used by non-expert annotators and calculated inter-annotator agreement amongst several coders. The results show that the annotation task is hard, with agreement scores ranging from 52% to 62% depending on the number of functions considered, but also that each of the independent annotations is in accordance with theoretical redictions regarding the possible distributions of indefinite functions. The resulting annotated corpus is available upon request and can be accessed through a searchable online database.
Document type Conference contribution
Language English
Published at http://www.lrec-conf.org/proceedings/lrec2012/pdf/362_Paper.pdf
Other links http://www.lrec-conf.org/proceedings/lrec2012/index.html
Downloads
362_Paper (Final published version)
Permalink to this page
Back