Bias of two-level scalability coefficients and their standard errors

Open Access
Authors
Publication date 05-2020
Journal Applied Psychological Measurement
Volume | Issue number 44 | 3
Pages (from-to) 197-214
Number of pages 18
Organisations
  • Faculty of Social and Behavioural Sciences (FMG) - Research Institute of Child Development and Education (RICDE)
Abstract
Two-level Mokken scale analysis is a generalization of Mokken scale analysis for multi-rater data. The bias of estimated scalability coefficients for two-level Mokken scale analysis, the bias of their estimated standard errors, and the coverage of the confidence intervals has been investigated, under various testing conditions. It was found that the estimated scalability coefficients were unbiased in all tested conditions. For estimating standard errors, the delta method and the cluster bootstrap were compared. The cluster bootstrap structurally underestimated the standard errors of the scalability coefficients, with low coverage values. Except for unequal numbers of raters across subjects and small sets of items, the delta method standard error estimates had negligible bias and good coverage. Post hoc simulations showed that the cluster bootstrap does
not correctly reproduce the sampling distribution of the scalability coefficients, and an adapted procedure was suggested. In addition, the delta method standard errors can be slightly improved if the harmonic mean is used for unequal numbers of raters per subject rather than the arithmetic mean.
Document type Article
Language English
Published at https://doi.org/10.1177/0146621619843821
Downloads
0146621619843821 (Final published version)
Permalink to this page
Back