Managing What We Can Measure: Quantifying the Susceptibility of Automated Scoring Systems to Gaming Behavior

Derrick Higgins,Michael Heilman

doi:10.1111/emip.12036

Managing What We Can Measure: Quantifying the Susceptibility of Automated Scoring Systems to Gaming Behavior

Derrick Higgins, Michael Heilman

https://doi.org/10.1111/emip.12036

Copy DOI

Journal: Educational Measurement: Issues and Practice	Publication Date: Sep 1, 2014
Citations: 38

Affiliation: Educational Testing Service

#Automated Student Assessment Prize #Response Behavior + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

As methods for automated scoring of constructed‐response items become more widely adopted in state assessments, and are used in more consequential operational configurations, it is critical that their susceptibility to gaming behavior be investigated and managed. This article provides a review of research relevant to how construct‐irrelevant response behavior may affect automated constructed‐response scoring, and aims to address a gap in that literature: the need to assess the degree of risk before operational launch. A general framework is proposed for evaluating susceptibility to gaming, and an initial empirical demonstration is presented using the open‐source short‐answer scoring engines from the Automated Student Assessment Prize (ASAP) Challenge.

Full Text