Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.

Todd Zhou,Hong Jiao

doi:10.1177/00131644221117193

Abstract

Cheating detection in large-scale assessment received considerable attention in the extant literature. However, none of the previous studies in this line of research investigated the stacking ensemble machine learning algorithm for cheating detection. Furthermore, no study addressed the issue of class imbalance using resampling. This study explored the application of the stacking ensemble machine learning algorithm to analyze the item response, response time, and augmented data of test-takers to detect cheating behaviors. The performance of the stacking method was compared with that of two other ensemble methods (bagging and boosting) as well as six base non-ensemble machine learning algorithms. Issues related to class imbalance and input features were addressed. The study results indicated that stacking, resampling, and feature sets including augmented summary data generally performed better than its counterparts in cheating detection. Compared with other competing machine learning algorithms investigated in this study, the meta-model from stacking using discriminant analysis based on the top two base models-Gradient Boosting and Random Forest-generally performed the best when item responses and the augmented summary statistics were used as the input features with an under-sampling ratio of 10:1 among all the study conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.

Abstract

Talk to us

Similar Papers

More From: Educational and psychological measurement

Lead the way for us

Journal: Educational and psychological measurement	Publication Date: Aug 13, 2022
Citations: 19

Similar Papers

Nonlinear Latent Effects in Diagnostic Classification Modeling Incorporating Response Times
Xin Qiao ... Manqian Liao
-
Xin Qiao, et. al.Xin Qiao ... Manqian Liao
01 Jan 2020
01 Jan 2020

Modelling individual response time effects between and within experimental speed conditions: A GLMM approach for speeded tests
Frank Goldhammer ... Ulf Kroehne
British Journal of Mathematical and Statistical Psychology | VOL. 70
Frank Goldhammer, et. al.Frank Goldhammer ... Ulf Kroehne
01 May 2017
British Journal of Mathematical and Statistical Psychology | VOL. 70

The effect of well-known burn-related features on machine learning algorithms in burn patients' mortality prediction.
Hilmi Yazıcı
Ulusal travma ve acil cerrahi dergisi = Turkish journal of trauma & emergency surgery : TJTES | VOL. 29
Hilmi YazıcıHilmi Yazıcı
01 Jan 2023
Ulusal travma ve acil cerrahi dergisi = Turkish journal of trauma & emergency surgery : TJTES | VOL. 29

Explanatory Cognitive Diagnostic Modeling Incorporating Response Times
Xin Qiao ... Hong Jiao
Journal of Educational Measurement | VOL. 58
Xin Qiao, et. al.Xin Qiao ... Hong Jiao
01 Dec 2021
Journal of Educational Measurement | VOL. 58

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.

Abstract

Talk to us

Similar Papers

More From: Educational and psychological measurement