Fairness-aware Data Integration

Lacramioara Mazilu,Norman W Paton,Alvaro A A Fernandes,Nikolaos Konstantinou

doi:10.1145/3519419

Lacramioara Mazilu, Norman W Paton + Show 2 more

Open Access

https://doi.org/10.1145/3519419

Copy DOI

Abstract

Machine learning can be applied in applications that take decisions that impact people’s lives. Such techniques have the potential to make decision making more objective, but there also is a risk that the decisions can discriminate against certain groups as a result of bias in the underlying data. Reducing bias, or promoting fairness, has been a focus of significant investigation in machine learning, for example, based on pre-processing the training data, changing the learning algorithm, or post-processing the results of the learning. However, prior to these activities, data integration discovers and integrates the data that is used for training, and data integration processes have the potential to produce data that leads to biased conclusions. In this article, we propose an approach that generates schema mappings in ways that take into account: (i) properties that are intrinsic to mapping results that may give rise to bias in analyses; and (ii) bias observed in classifiers trained on the results of different sets of mappings. The approach explores a space of different ways of integrating the data, using a Tabu search algorithm, guided by bias-aware objective functions that represent different types of bias.The resulting approach is evaluated using Adult Census and German Credit datasets to explore the extent to which and the circumstances in which the approach can increase the fairness of the results of the data integration process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fairness-aware Data Integration

Abstract

Talk to us

Similar Papers

More From: Journal of Data and Information Quality

Lead the way for us

Journal: Journal of Data and Information Quality	Publication Date: Nov 23, 2022
Citations: 1

Similar Papers

An ontology-based documentation of data discovery and integration process in cancer outcomes research
Hansi Zhang ... Yi Guo
BMC Medical Informatics and Decision Making | VOL. 20
Hansi Zhang, et. al.Hansi Zhang ... Yi Guo
01 Dec 2020
BMC Medical Informatics and Decision Making | VOL. 20

Chapter 11 - Data Integration Design and Development
Rick Sherman
Business Intelligence Guidebook | VOL. -
Rick ShermanRick Sherman
21 Nov 2014
Business Intelligence Guidebook | VOL. -

Logical Optimization of Dataflows for Data Mining and Integration Processes
Alexander Wohrer ... Eduard Mehofer
-
Alexander Wohrer, et. al.Alexander Wohrer ... Eduard Mehofer
01 Dec 2010
01 Dec 2010

Architecture Enabling Adaptation of Data Integration Processes for a Research Information System
Darja Solodovnikova ... Aivars Niedritis
Foundations of Computing and Decision Sciences | VOL. 43
Darja Solodovnikova, et. al.Darja Solodovnikova ... Aivars Niedritis
01 Jun 2018
Foundations of Computing and Decision Sciences | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fairness-aware Data Integration

Abstract

Talk to us

Similar Papers

More From: Journal of Data and Information Quality