Approximate measurement invariance in cross-classified rater-mediated assessments.

Ben Kelcey,Dan Mcginn,Heather Hill

doi:10.3389/fpsyg.2014.01469

Ben Kelcey, Dan Mcginn + Show 1 more

Open Access

https://doi.org/10.3389/fpsyg.2014.01469

Copy DOI

Abstract

An important assumption underlying meaningful comparisons of scores in rater-mediated assessments is that measurement is commensurate across raters. When raters differentially apply the standards established by an instrument, scores from different raters are on fundamentally different scales and no longer preserve a common meaning and basis for comparison. In this study, we developed a method to accommodate measurement noninvariance across raters when measurements are cross-classified within two distinct hierarchical units. We conceptualized random item effects cross-classified graded response models and used random discrimination and threshold effects to test, calibrate, and account for measurement noninvariance among raters. By leveraging empirical estimates of rater-specific deviations in the discrimination and threshold parameters, the proposed method allows us to identify noninvariant items and empirically estimate and directly adjust for this noninvariance within a cross-classified framework. Within the context of teaching evaluations, the results of a case study suggested substantial noninvariance across raters and that establishing an approximately invariant scale through random item effects improves model fit and predictive validity.

Highlights

RATER-MEDIATED ASSESSMENTS Raters have played a critical role in evaluating a wide range of psychological, cognitive, and physical traits
If we further introduce random item effects into the cross-classified model (Equation 4), we relax this assumption of equality of item parameters across raters and allow the discrimination and threshold parameters to vary
Table 1 presents the posterior item parameter estimates from a single level, a multilevel, a cross-classified, and a random item effects cross-classified graded response models (Equation 4)

Summary

Introduction

RATER-MEDIATED ASSESSMENTS Raters have played a critical role in evaluating a wide range of psychological, cognitive, and physical traits. The impetus for the use of rater-mediated assessments stems largely from the position that they often allow for more authentic and relevant assessments, thereby improving support for the validity of an assessment. Despite the flexibility and authenticity offered by rater-mediated assessments, they are often paired with features that, without proper treatment, can undermine their validity and reliability. Perhaps the most commonly cited rater effect is the differences among raters in terms of the severity with which they apply their evaluations. Other common rater effects include a halo effect and a central/extreme tendency effect. Central/extreme tendencies manifest when raters avoid or use only the extreme categories of a scale (Baumgartner and Steenkamp, 2001)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Psychology	Publication Date: Dec 23, 2014
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

Approximate measurement invariance in cross-classified rater-mediated assessments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology

Lead the way for us

Similar Papers

Comparing results of an exact vs. an approximate (Bayesian) measurement invariance test: a cross-country illustration with a scale to measure 19 human values.
Jan Cieciuch ... Eldad Davidov
Frontiers in psychology | VOL. 5
Jan Cieciuch, et. al.Jan Cieciuch ... Eldad Davidov
08 Sep 2014
Frontiers in psychology | VOL. 5

Testing for Approximate Measurement Invariance of Human Values in the European Social Survey
Jan Cieciuch ... René Algesheimer
Sociological Methods & Research | VOL. 47
Jan Cieciuch, et. al.Jan Cieciuch ... René Algesheimer
10 Apr 2017
Sociological Methods & Research | VOL. 47

Longitudinal Measurement (Non)Invariance in Latent Constructs
Heinz Leitgöb ... Peter Schmidt
-
Heinz Leitgöb, et. al.Heinz Leitgöb ... Peter Schmidt
18 Mar 2021
18 Mar 2021

Cross-National Measurement of Mathematics Intrinsic Motivation: An Investigate of Measurement Invariance with MG-CFA and Aligment Method Across Fourteen Countries
Mahmut Sami Yi̇ği̇ter
Kuramsal Eğitimbilim | VOL. 17
Mahmut Sami Yi̇ği̇terMahmut Sami Yi̇ği̇ter
28 Jan 2024
Kuramsal Eğitimbilim | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximate measurement invariance in cross-classified rater-mediated assessments.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Psychology