Diversity-aware fairness testing of machine learning classifiers through hashing-based sampling

Zhenjiang Zhao,Takahisa Toda,Takashi Kitamura

doi:10.1016/j.infsof.2023.107390

Abstract

Context:There are growing concerns about algorithmic fairness, as some machine learning (ML)-based algorithms have been found to exhibit biases against protected attributes such as gender, race, age and so on. Individual fairness requires an ML classifier to produce similar outputs for similar individuals. Verification Based Testing (Vbt) is a state-of-the-art black-box testing algorithm for individual fairness that leverages constraint solving to generate test cases. Objective:Generating diverse test cases is expected to facilitate efficient detection of diverse discriminatory data instances (i.e., cases that violate individual fairness). Hashing-based sampling techniques draw a sample approximately uniformly at random from the set of solutions of given Boolean constraints. We propose Vbt-X, which improves Vbt with hashing-based sampling, aiming to improve its testing performance. Method:We realize hashing-based sampling for Vbt. The challenge is that the off-the-shelf hashing-based sampling techniques cannot be integrated in a straightforward manner because the constraints in Vbt are generally not Boolean. Moreover, we propose several enhancement techniques to make Vbt-X more efficient. Results:To evaluate our method, we conduct experiments, where Vbt-X is compared to Vbt, Sg and ExpGA (other well-known fairness testing algorithms) over a set of configurations consisting of several datasets, protected attributes, and ML classifiers. The results show that, with each configuration, Vbt-X detects more discriminatory data instances with higher diversity than Vbt and Sg. Vbt-X detects discriminatory data instances with higher diversity than ExpGA, though the number of discriminatory data instances detected by Vbt-X is lesser than ExpGA. Conclusion:Our proposed method performs better than other state-of-the-art black-box fairness testing algorithms, particularly in terms of diversity. Our method can serve to efficiently identify flaws in ML classifiers with respect to individual fairness for subsequent improvements of an ML classifier. On the other hand, although our method is specific to individual fairness, it could work for testing other aspects of a software system such as security and counterfactual explanations with some technical adaptations, which remains for future work.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Diversity-aware fairness testing of machine learning classifiers through hashing-based sampling

Abstract

Talk to us

Similar Papers

More From: Information and Software Technology

Lead the way for us

Journal: Information and Software Technology	Publication Date: Dec 9, 2023
Citations: 2

Similar Papers

A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.
Hongbo Chen ... Dulaney Wilson
JMIR Human Factors | VOL. 11
Hongbo Chen, et. al.Hongbo Chen ... Dulaney Wilson
25 Jan 2024
JMIR Human Factors | VOL. 11

Information-Theoretic Bounds on Quantum Advantage in Machine Learning.
Hsin-Yuan Huang ... John Preskill
Physical Review Letters | VOL. 126
Hsin-Yuan Huang, et. al.Hsin-Yuan Huang ... John Preskill
14 May 2021
Physical Review Letters | VOL. 126

Radiomic machine learning for pretreatment assessment of prognostic risk factors for endometrial cancer and its effects on radiologists' decisions of deep myometrial invasion
Satoshi Otani ... Aki Kido
Magnetic Resonance Imaging | VOL. 85
Satoshi Otani, et. al.Satoshi Otani ... Aki Kido
20 Oct 2021
Magnetic Resonance Imaging | VOL. 85

Glioma Tumor Grading Using Radiomics on Conventional MRI: A Comparative Study of WHO 2021 and WHO 2016 Classification of Central Nervous Tumors.
Farzan Moodi ... Hamidreza Saligheh Rad
Journal of magnetic resonance imaging : JMRI | VOL. 60
Farzan Moodi, et. al.Farzan Moodi ... Hamidreza Saligheh Rad
29 Nov 2023
Journal of magnetic resonance imaging : JMRI | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diversity-aware fairness testing of machine learning classifiers through hashing-based sampling

Abstract

Talk to us

Similar Papers

More From: Information and Software Technology