Taming our wild data: On intercoder reliability in discourse research

R. van Enschot; W. Spooren; A. van den Bosch; C. Burgers; L. Degand; J. Evers-Vermeul; F. Kunneman; C. Liebrecht; Y. Linders; A. Maes

doi:https://doi.org/10.51751/dujal16248

Taming our wild data: On intercoder reliability in discourse research

Authors	R. van Enschot W. Spooren A. van den Bosch C. Burgers L. Degand J. Evers-Vermeul F. Kunneman C. Liebrecht Y. Linders A. Maes
Publication date	2024
Journal	Dutch Journal of Applied Linguistics
Volume \| Issue number	13
Number of pages	24
Organisations	Faculty of Social and Behavioural Sciences (FMG) - Amsterdam School of Communication Research (ASCoR)
Abstract	Many research questions in the field of applied linguistics are answered by manually analyzing data collections or corpora: collections of spoken, written and/or visual communicative messages. In this kind of quantitative content analysis, the coding of subjective language data often leads to disagreement among raters. In this paper, we discuss causes of and solutions to disagreement problems in the analysis of discourse. We discuss crucial factors determining the quality and outcome of corpus analyses, and focus on the sometimes tense relation between reliability and validity. We evaluate formal assessments of intercoder reliability. We suggest a number of ways to improve the intercoder reliability, such as the precise specification of the variables and their coding categories and carving up the coding process into smaller substeps. The paper ends with a reflection on challenges for future work in discourse analysis, with special attention to big data and multimodal discourse.
Document type	Article
Language	English
Published at	https://doi.org/10.51751/dujal16248 (Final published version)
Downloads	van Enschot et al (2024) (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Taming our wild data: On intercoder reliability in discourse research