Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Anna Filighera,Sebastian Ochs,Tim Steuer,Thomas Tregel

doi:10.1007/s40593-023-00361-2

Abstract

Automatic grading models are valued for the time and effort saved during the instruction of large student bodies. Especially with the increasing digitization of education and interest in large-scale standardized testing, the popularity of automatic grading has risen to the point where commercial solutions are widely available and used. However, for short answer formats, automatic grading is challenging due to natural language ambiguity and versatility. While automatic short answer grading models are beginning to compare to human performance on some datasets, their robustness, especially to adversarially manipulated data, is questionable. Exploitable vulnerabilities in grading models can have far-reaching consequences ranging from cheating students receiving undeserved credit to undermining automatic grading altogether—even when most predictions are valid. In this paper, we devise a black-box adversarial attack tailored to the educational short answer grading scenario to investigate the grading models’ robustness. In our attack, we insert adjectives and adverbs into natural places of incorrect student answers, fooling the model into predicting them as correct. We observed a loss of prediction accuracy between 10 and 22 percentage points using the state-of-the-art models BERT and T5. While our attack made answers appear less natural to humans in our experiments, it did not significantly increase the graders’ suspicions of cheating. Based on our experiments, we provide recommendations for utilizing automatic grading systems more safely in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Artificial Intelligence in Education	Publication Date: Jul 26, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence in Education

Lead the way for us

Similar Papers

A scoring rubric for automatic short answer grading system
Uswatun Hasanah ... Feddy Setio Pribadi
TELKOMNIKA (Telecommunication Computing Electronics and Control) | VOL. 17
Uswatun Hasanah, et. al.Uswatun Hasanah ... Feddy Setio Pribadi
01 Apr 2019
TELKOMNIKA (Telecommunication Computing Electronics and Control) | VOL. 17

Deep learning based Arabic short answer grading in serious games
Younes Alaoui Soulimani ... Lotfi El Achaak
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 14
Younes Alaoui Soulimani, et. al.Younes Alaoui Soulimani ... Lotfi El Achaak
01 Feb 2024
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 14

Automatic short answer grading and feedback using text mining methods
Neslihan Süzen ... Evgeny M Mirkes
Procedia Computer Science | VOL. 169
Neslihan Süzen, et. al.Neslihan Süzen ... Evgeny M Mirkes
01 Jan 2020
Procedia Computer Science | VOL. 169

Latent Semantic Analysis and Winnowing Algorithm Based Automatic Japanese Short Essay Answer Grading System Comparative Performance
Anak Agung Putri Ratna ... Dyah Lalita Luhurkinanti
-
Anak Agung Putri Ratna, et. al.Anak Agung Putri Ratna ... Dyah Lalita Luhurkinanti
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cheating Automatic Short Answer Grading with the Adversarial Usage of Adjectives and Adverbs

Abstract

Talk to us

Similar Papers

More From: International Journal of Artificial Intelligence in Education