Exploring the correspondence between traditional score resolution methods and person fit indices in rater-mediated writing assessments

Stefanie A Wind,A Adrienne Walker

doi:10.1016/j.asw.2018.12.002

Abstract

Abstract Scoring procedures for rater-mediated writing assessments often include checks for agreement between the raters who score students’ essays. When raters assign non-adjacent ratings to the same essay, a third rater is often employed to “resolve” the discrepant ratings. The procedures for flagging essays for score resolution are similar to person fit analyses based on item response theory (IRT). We used data from two writing performance assessments in science and social studies to explore the correspondence between traditional score resolution procedures and IRT person fit statistics. We observed that rater agreement criteria and person fit criteria flag many, but not all, of the same rating profiles for additional investigation. We also observed significantly different values of person fit statistics between students whose essays were and were not flagged for third ratings by the rater agreement criteria. Finally, when we used resolved ratings in place of the original ratings, we observed improvements in person fit for most, but not all, of the students whose essays were flagged for third ratings. These results suggest that person fit analyses may provide a complimentary approach to rater agreement criteria. We discuss these results in terms of their implications for research and practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the correspondence between traditional score resolution methods and person fit indices in rater-mediated writing assessments

Abstract

Talk to us

Similar Papers

More From: Assessing Writing

Lead the way for us

Journal: Assessing Writing	Publication Date: Dec 6, 2018
Citations: 12

Similar Papers

Performance of Nonparametric Person-Fit Statistics with Unfolding versus Dominance Response Models
Jennifer Reimers ... Elizabeth Keiffer
Measurement: Interdisciplinary Research and Perspectives | VOL. 21
Jennifer Reimers, et. al.Jennifer Reimers ... Elizabeth Keiffer
02 Oct 2023
Measurement: Interdisciplinary Research and Perspectives | VOL. 21

The Effect of Person Misfit on Item Parameter Estimation and Classification Accuracy: A Simulation Study
Amin Mousavi ... Ying Cui
Education Sciences | VOL. 10
Amin Mousavi, et. al.Amin Mousavi ... Ying Cui
09 Nov 2020
Education Sciences | VOL. 10

Total, Between-, and Within-Item Attribute Person Fit Analysis Using Mokken Scaling Techniques: An Exploratory Nonparametric Approach to Person Fit
Stefanie A Wind
The Journal of Experimental Education | VOL. ahead-of-print
Stefanie A WindStefanie A Wind
30 Aug 2024
The Journal of Experimental Education | VOL. ahead-of-print

A Comprehensive Approach for Assessing Person Fit With Test–Retest Data
Pere J Ferrando
Educational and Psychological Measurement | VOL. 74
Pere J FerrandoPere J Ferrando
20 Jan 2014
Educational and Psychological Measurement | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the correspondence between traditional score resolution methods and person fit indices in rater-mediated writing assessments

Abstract

Talk to us

Similar Papers

More From: Assessing Writing