Big Data’s Disparate Impact

Solon Barocas ,Andrew D Selbst

doi:10.15779/z38bg31

Abstract

Advocates of algorithmic techniques like data mining argue that these techniques eliminate human biases from the decision-making process. But an algorithm is only as good as the data it works with. Data is frequently imperfect in ways that allow these algorithms to inherit the prejudices of prior decision makers. In other cases, data may simply reflect the widespread biases that persist in society at large. In still others, data mining can discover surprisingly useful regularities that are really just preexisting patterns of exclusion and inequality. Unthinking reliance on data mining can deny historically disadvantaged and vulnerable groups full participation in society. Worse still, because the resulting discrimination is almost always an unintentional emergent property of the algorithm’s use rather than a conscious choice by its programmers, it can be unusually hard to identify the source of the problem or to explain it to a court.This Essay examines these concerns through the lens of American antidiscrimination law — more particularly, through Title VII’s prohibition of discrimination in employment. In the absence of a demonstrable intent to discriminate, the best doctrinal hope for data mining’s victims would seem to lie in disparate impact doctrine. Case law and the Equal Employment Opportunity Commission’s Uniform Guidelines, though, hold that a practice can be justified as a business necessity when its outcomes are predictive of future employment outcomes, and data mining is specifically designed to find such statistical correlations. Unless there is a reasonably practical way to demonstrate that these discoveries are spurious, Title VII would appear to bless its use, even though the correlations it discovers will often reflect historic patterns of prejudice, others’ discrimination against members of protected groups, or flaws in the underlying dataAddressing the sources of this unintentional discrimination and remedying the corresponding deficiencies in the law will be difficult technically, difficult legally, and difficult politically. There are a number of practical limits to what can be accomplished computationally. For example, when discrimination occurs because the data being mined is itself a result of past intentional discrimination, there is frequently no obvious method to adjust historical data to rid it of this taint. Corrective measures that alter the results of the data mining after it is complete would tread on legally and politically disputed terrain. These challenges for reform throw into stark relief the tension between the two major theories underlying antidiscrimination law: anticlassification and antisubordination. Finding a solution to big data’s disparate impact will require more than best efforts to stamp out prejudice and bias; it will require a wholesale reexamination of the meanings of “discrimination” and “fairness.”

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Big Data’s Disparate Impact

Abstract

Talk to us

Similar Papers

More From: California Law Review

Lead the way for us

Journal: California Law Review	Publication Date: Jan 1, 2016
Citations: 492

Similar Papers

Big Data's Disparate Impact
Solon Barocas ... Andrew D Selbst
SSRN Electronic Journal | VOL. 104
Solon Barocas, et. al.Solon Barocas ... Andrew D Selbst
11 Aug 2014
SSRN Electronic Journal | VOL. 104

Be Careful What You Wish For: Ronald Reagan, Donald Trump, The Assault on Civil Rights, and The Surprising Story of How Title VII Got Its Private Right of Action
...
Berkeley Journal of Employment and Labor Law | VOL. 39
, et. al. ...
01 Jan 2018
Berkeley Journal of Employment and Labor Law | VOL. 39

EEO enforcement activity in 2007: A sign of things to come?
Eric Dunleavy ... Art Gutman
-
Eric Dunleavy, et. al.Eric Dunleavy ... Art Gutman
01 Jan 2008
EEO enforcement activity in 2007: A sign of things to come?
Eric Dunleavy ... Art Gutman

Which Industries Are the Best Employers for Women? An Application of a New Equal Employment Opportunity Index
Julie L Hotchkiss ... Mary E Graham
SSRN Electronic Journal | VOL. 2003
Julie L Hotchkiss, et. al.Julie L Hotchkiss ... Mary E Graham
13 Nov 2003
SSRN Electronic Journal | VOL. 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Big Data’s Disparate Impact

Abstract

Talk to us

Similar Papers

More From: California Law Review