Detecting Careless Cases in Practice Tests

Steven Nydick

doi:10.59863/lavm1367

Abstract

In this paper, we present a novel method for detecting careless responses in a low-stakes practice exam using machine learning models. Rather than classifying test-taker responses as careless based on model fit statistics or knowledge of truth, we built a model to predict significant changes in test scores between a practice test and an official test based on attributes of practice test items. We extracted features from practice test items using hypotheses about how careless test takers respond to items and cross-validated model performance to optimize out-of-sample predictions and reduce heteroscedasticity when predicting the closest official test. All analyses use data from the practice and official versions of the Duolingo English Test. We discuss the implications of using a machine learning model for predicting careless cases as compared with alternative, popular methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting Careless Cases in Practice Tests

Abstract

Talk to us

Similar Papers

More From: Chinese/English Journal of Educational Measurement and Evaluation

Lead the way for us

Similar Papers

The effect of two aspects of grit on developmental change in high school students' academic performance: Findings from a five-wave longitudinal study over the course of three years
Kazuji Nishikawa ... Takatomo Shirakawa
Personality and Individual Differences | VOL. 191
Kazuji Nishikawa, et. al.Kazuji Nishikawa ... Takatomo Shirakawa
19 Feb 2022
Personality and Individual Differences | VOL. 191

Cognition and Incident Dementia Hospitalization: Results from the Atherosclerosis Risk in Communities Study
Andrea L.C Schneider ... Thomas Mosley
Neuroepidemiology | VOL. 40
Andrea L.C Schneider, et. al.Andrea L.C Schneider ... Thomas Mosley
22 Oct 2012
Neuroepidemiology | VOL. 40

Domain-specificity of Flynn effects in the CHC-model: Stratum II test score changes in Germanophone samples (1996–2018)
Alexandros Lazaridis ... Jakob Pietschnig
Intelligence | VOL. 95
Alexandros Lazaridis, et. al.Alexandros Lazaridis ... Jakob Pietschnig
01 Nov 2022
Intelligence | VOL. 95

Is ability-based emotional intelligence impervious to the Flynn effect? A cross-temporal meta-analysis (2001–2015)
Jakob Pietschnig ... Georg Gittler
Intelligence | VOL. 61
Jakob Pietschnig, et. al.Jakob Pietschnig ... Georg Gittler
03 Jan 2017
Is ability-based emotional intelligence impervious to the Flynn effect? A cross-temporal meta-analysis (2001–2015)
Jakob Pietschnig ... Georg Gittler

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Careless Cases in Practice Tests

Abstract

Talk to us

Similar Papers

More From: Chinese/English Journal of Educational Measurement and Evaluation