Lessons from debiasing data for fair and accurate predictive modeling in education

Lele Sha,Dragan Gašević,Guanliang Chen

doi:10.1016/j.eswa.2023.120323

Lele Sha, Dragan Gašević + Show 1 more

Open Access

https://doi.org/10.1016/j.eswa.2023.120323

Copy DOI

Journal: Expert Systems with Applications	Publication Date: May 8, 2023
Citations: 5	License type: cc-by-nc-nd

Affiliation: Monash University

Abstract

The past few years have witnessed an explosion of attention given to the bias displayed by Machine Learning (ML) techniques towards different groups of people (e.g., female vs. male). Although ML techniques have been widely adopted in education, it remains largely unexplored that to what extent such ML bias manifests itself in this specific setting and how it can be reduced and eliminated. Given the increasing importance of ML techniques in empowering educators to teach effectively, this study aimed to quantify the characteristics of the original datasets that might be correlated with the subsequent predictive unfairness displayed by ML models. To this end, we empirically investigated two types of data biases (i.e., distribution bias and hardness bias) towards students of different sexes and first-language backgrounds across a total of five frequently-performed predictive tasks in education. Then, to improve ML fairness, we drew inspiration from the well-established research in Class Balancing Techniques (CBTs), where samples are generated/removed to alleviate the predictive disparity between different prediction classes. We proposed two simple but effective strategies to empower class balancing techniques for alleviating data biases and improving prediction fairness. Through extensive analyses and evaluations, we demonstrated that ML models may greatly improve prediction fairness (improvement up to 66%) with only a small sacrifice (less than 1%) in prediction accuracy by balancing the training data with the use of students’ demographic information and the overall hardness bias measure. All data and code used in this study are publicly accessible via https://github.com/lsha49/FairEdu.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lessons from debiasing data for fair and accurate predictive modeling in education

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Similar Papers

Algorithmic fairness in computational medicine.
Jie Xu ... Fei Wang
eBioMedicine | VOL. 84
Jie Xu, et. al.Jie Xu ... Fei Wang
06 Sep 2022
eBioMedicine | VOL. 84

Review of Machine Learning Techniques in Soft Tissue Biomechanics and Biomaterials.
Samir Donmazov ... Eda Nur Saruhan
Cardiovascular engineering and technology | VOL. -
Samir Donmazov, et. al.Samir Donmazov ... Eda Nur Saruhan
02 Jul 2024
Cardiovascular engineering and technology | VOL. -

Sensitivity evaluation of machine learning-based calibrated transportation mode choice models: A case study of Alexandria City, Egypt
Ahmed Mahmoud Darwish ... Ahmed Elkafoury
Transportation Research Interdisciplinary Perspectives | VOL. 24
Ahmed Mahmoud Darwish, et. al.Ahmed Mahmoud Darwish ... Ahmed Elkafoury
01 Mar 2024
Transportation Research Interdisciplinary Perspectives | VOL. 24

Machine Learning and Radiomics Applications in Esophageal Cancers Using Non-Invasive Imaging Methods-A Critical Review of Literature.
Chen-Yi Xie ... Varut Vardhanabhuti
Cancers | VOL. 13
Chen-Yi Xie, et. al.Chen-Yi Xie ... Varut Vardhanabhuti
19 May 2021
Cancers | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lessons from debiasing data for fair and accurate predictive modeling in education

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications