Provable Detection of Propagating Sampling Bias in Prediction Models

Pavan Ravishankar,Qingyu Mo,Edward Mcfowland Iii,Daniel B Neill

doi:10.1609/aaai.v37i8.26144

Abstract

With an increased focus on incorporating fairness in machine learning models, it becomes imperative not only to assess and mitigate bias at each stage of the machine learning pipeline but also to understand the downstream impacts of bias across stages. Here we consider a general, but realistic, scenario in which a predictive model is learned from (potentially biased) training data, and model predictions are assessed post-hoc for fairness by some auditing method. We provide a theoretical analysis of how a specific form of data bias, differential sampling bias, propagates from the data stage to the prediction stage. Unlike prior work, we evaluate the downstream impacts of data biases quantitatively rather than qualitatively and prove theoretical guarantees for detection. Under reasonable assumptions, we quantify how the amount of bias in the model predictions varies as a function of the amount of differential sampling bias in the data, and at what point this bias becomes provably detectable by the auditor. Through experiments on two criminal justice datasets-- the well-known COMPAS dataset and historical data from NYPD's stop and frisk policy-- we demonstrate that the theoretical results hold in practice even when our assumptions are relaxed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Provable Detection of Propagating Sampling Bias in Prediction Models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Machine learning approaches for formation matrix volume prediction from well logs: Insights and lessons learned
Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
Geoenergy Science and Engineering | VOL. 229
Pamidi Venkata Durga Kannaiah, et. al.Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
08 Jul 2023
Geoenergy Science and Engineering | VOL. 229

Demonstration and Mitigation of Spatial Sampling Bias for Machine-Learning Predictions
Wendi Liu ... Svetlana Ikonnikova
SPE Reservoir Evaluation & Engineering | VOL. 24
Wendi Liu, et. al.Wendi Liu ... Svetlana Ikonnikova
05 Oct 2020
SPE Reservoir Evaluation & Engineering | VOL. 24

Uncovering Bias: Exploring Machine Learning Techniques for Detecting and Mitigating Bias in Data – A Literature Review
Arun Padmanabhan, K Devasenapathy
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
Arun Padmanabhan, K DevasenapathyArun Padmanabhan, K Devasenapathy
30 Oct 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Protected attribute guided representation learning for bias mitigation in limited data
Pratik Mazumder ... Pravendra Singh
Knowledge-Based Systems | VOL. 244
Pratik Mazumder, et. al.Pratik Mazumder ... Pravendra Singh
23 Feb 2022
Knowledge-Based Systems | VOL. 244

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Provable Detection of Propagating Sampling Bias in Prediction Models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence