A Comparison of Gwet’s AC1 and Kappa When Calculating Inter-Rater Reliability Coefficients in a Teacher Evaluation Context

Albert M Jimenez,Sally J Zepeda

doi:10.3138/jehr-2019-0001

A Comparison of Gwet’s AC1 and Kappa When Calculating Inter-Rater Reliability Coefficients in a Teacher Evaluation Context

Albert M Jimenez, Sally J Zepeda

https://doi.org/10.3138/jehr-2019-0001

Copy DOI

Journal: Journal of Education Human Resources	Publication Date: Aug 1, 2020
Citations: 3

Affiliation: Kennesaw State University, University of Georgia

#AC1 Statistic #Inter-rater Reliability + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

With increased emphasis on teacher quality in the Race to the Top federal grants program, rater agreement is an important topic in teacher evaluation. Variations of kappa have often been used to assess inter-rater reliability (IRR). Research has shown that kappa suffers from a paradox where high exact agreement can produce low kappa values. Two chance-corrected methods of IRR were examined to determine if Gwet’s AC1 statistic is a more stable estimate than kappa. Findings suggest that Gwet’s AC1 statistic outperforms kappa as a chance-corrected measure of IRR when compared to exact agreement. Findings suggest Gwet’s AC1 statistic shows promise for future IRR studies in a teacher evaluation context.

Full Text