Machine learning and the politics of synthetic data

Benjamin N Jacobsen

doi:10.1177/20539517221145372

Abstract

Machine-learning algorithms have become deeply embedded in contemporary society. As such, ample attention has been paid to the contents, biases, and underlying assumptions of the training datasets that many algorithmic models are trained on. Yet, what happens when algorithms are trained on data that are not real, but instead data that are ‘synthetic’, not referring to real persons, objects, or events? Increasingly, synthetic data are being incorporated into the training of machine-learning algorithms for use in various societal domains. There is currently little understanding, however, of the role played by and the ethicopolitical implications of synthetic training data for machine-learning algorithms. In this article, I explore the politics of synthetic data through two central aspects: first, synthetic data promise to emerge as a rich source of exposure to variability for the algorithm. Second, the paper explores how synthetic data promise to place algorithms beyond the realm of risk. I propose that an analysis of these two areas will help us better understand the ways in which machine-learning algorithms are envisioned in the light of synthetic data, but also how synthetic training data actively reconfigure the conditions of possibility for machine learning in contemporary society.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Big Data & Society	Publication Date: Jan 1, 2023
Citations: 19	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Machine learning and the politics of synthetic data

Abstract

Talk to us

Similar Papers

More From: Big Data & Society

Lead the way for us

Similar Papers

Utilization of synthetic minority oversampling technique for improving potato yield prediction using remote sensing data and machine learning algorithms with small sample size of yield data
Hamid Ebrahimy ... Zhou Zhang
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 201
Hamid Ebrahimy, et. al.Hamid Ebrahimy ... Zhou Zhang
24 May 2023
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 201

Synthetic data at scale: a development model to efficiently leverage machine learning in agriculture.
Jonathan Klein ... Dominik L Michels
Frontiers in plant science | VOL. 15
Jonathan Klein, et. al.Jonathan Klein ... Dominik L Michels
16 Sep 2024
Frontiers in plant science | VOL. 15

Image-to-image translation for improvement of synthetic thermal infrared training data using generative adversarial networks
Hanna Hamrell ... Jörgen Karlholm
-
Hanna Hamrell, et. al.Hanna Hamrell ... Jörgen Karlholm
12 Sep 2021
12 Sep 2021

Evaluation of synthetic and experimental training data in supervised machine learning applied to charge-state detection of quantum dots
J Darulová ... M C Cassidy
Machine Learning: Science and Technology | VOL. 2
J Darulová, et. al.J Darulová ... M C Cassidy
13 Sep 2021
Machine Learning: Science and Technology | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine learning and the politics of synthetic data

Abstract

Talk to us

Similar Papers

More From: Big Data &amp; Society

More From: Big Data & Society