Equity‐weighted bootstrapping: Examples and analysis

Harish S Bhat,Sidra Goldman‐Mellor,Majerle E Reeves

doi:10.1002/sta4.456

Abstract

When faced with severely imbalanced binary classification problems, we often train models on bootstrapped data in which the number of instances of each class occur in a more favorable ratio, often equal to one. We view algorithmic inequity through the lens of imbalanced classification: In order to balance the performance of a classifier across groups, we can bootstrap to achieve training sets that are balanced with respect to both labels and group identity. For an example problem with severe class imbalance—prediction of suicide death from administrative patient records—we illustrate how an equity‐directed bootstrap can bring test set sensitivities and specificities much closer to satisfying the equal odds criterion. In the context of naïve Bayes and logistic regression, we analyse the equity‐weighted bootstrap, demonstrating that it works by bringing odds ratios close to one, and linking it to methods involving intercept adjustment, thresholding, and weighting.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Equity‐weighted bootstrapping: Examples and analysis

Abstract

Talk to us

Similar Papers

More From: Stat

Lead the way for us

Similar Papers

Binary classification with fuzzy logistic regression under class imbalance and complete separation in clinical studies
Georgios Charizanos ... Duygu İçen
BMC Medical Research Methodology | VOL. 24
Georgios Charizanos, et. al.Georgios Charizanos ... Duygu İçen
05 Jul 2024
BMC Medical Research Methodology | VOL. 24

Imbalanced learning for wind turbine blade icing detection via spatio-temporal attention model with a self-adaptive weight loss function
Guoqian Jiang ... Ruxu Yue
Expert Systems with Applications | VOL. 229
Guoqian Jiang, et. al.Guoqian Jiang ... Ruxu Yue
01 Nov 2023
Expert Systems with Applications | VOL. 229

Experimental Studies on the Impact of Data Sampling with Severely Imbalanced Big Data
Tawfiq Hasanin ... Taghi M Khoshgoftaar
-
Tawfiq Hasanin, et. al.Tawfiq Hasanin ... Taghi M Khoshgoftaar
24 Mar 2020
24 Mar 2020

Attacking Bitcoin anonymity: generative adversarial networks for improving Bitcoin entity classification
Francesco Zola ... Lander Segurola-Gil
Applied Intelligence | VOL. 52
Francesco Zola, et. al.Francesco Zola ... Lander Segurola-Gil
01 Apr 2022
Applied Intelligence | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Equity‐weighted bootstrapping: Examples and analysis

Abstract

Talk to us

Similar Papers

More From: Stat