Bias of two-level scalability coefficients and their standard errors

L. Koopman; B.J.H. Zijlstra; M. de Rooij; L.A. van der Ark

doi:https://doi.org/10.1177/0146621619843821

Bias of two-level scalability coefficients and their standard errors

Authors	L. Koopman B.J.H. Zijlstra M. de Rooij L.A. van der Ark
Publication date	05-2020
Journal	Applied Psychological Measurement
Volume \| Issue number	44 \| 3
Pages (from-to)	197-214
Number of pages	18
Organisations	Faculty of Social and Behavioural Sciences (FMG) - Research Institute of Child Development and Education (RICDE)
Abstract	Two-level Mokken scale analysis is a generalization of Mokken scale analysis for multi-rater data. The bias of estimated scalability coefficients for two-level Mokken scale analysis, the bias of their estimated standard errors, and the coverage of the confidence intervals has been investigated, under various testing conditions. It was found that the estimated scalability coefficients were unbiased in all tested conditions. For estimating standard errors, the delta method and the cluster bootstrap were compared. The cluster bootstrap structurally underestimated the standard errors of the scalability coefficients, with low coverage values. Except for unequal numbers of raters across subjects and small sets of items, the delta method standard error estimates had negligible bias and good coverage. Post hoc simulations showed that the cluster bootstrap does not correctly reproduce the sampling distribution of the scalability coefficients, and an adapted procedure was suggested. In addition, the delta method standard errors can be slightly improved if the harmonic mean is used for unequal numbers of raters per subject rather than the arithmetic mean.
Document type	Article
Language	English
Published at	https://doi.org/10.1177/0146621619843821 (Final published version)
Downloads	0146621619843821 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Bias of two-level scalability coefficients and their standard errors