Theory In, Theory Out: The Uses of Social Theory in Machine Learning for Social Science.

Jason Radford,Kenneth Joseph

doi:10.3389/fdata.2020.00018

Abstract

Research at the intersection of machine learning and the social sciences has provided critical new insights into social behavior. At the same time, a variety of issues have been identified with the machine learning models used to analyze social data. These issues range from technical problems with the data used and features constructed, to problematic modeling assumptions, to limited interpretability, to the models' contributions to bias and inequality. Computational researchers have sought out technical solutions to these problems. The primary contribution of the present work is to argue that there is a limit to these technical solutions. At this limit, we must instead turn to social theory. We show how social theory can be used to answer basic methodological and interpretive questions that technical solutions cannot when building machine learning models, and when assessing, comparing, and using those models. In both cases, we draw on related existing critiques, provide examples of how social theory has already been used constructively in existing work, and discuss where other existing work may have benefited from the use of specific social theories. We believe this paper can act as a guide for computer and social scientists alike to navigate the substantive questions involved in applying the tools of machine learning to social data.

Highlights

Machine learning is increasingly being applied to vast quantities of social data generated from and about people (Lazer et al, 2009)
Scholars have argued that machine learning models applied to social data often do not account for myriad biases that arise during the analysis pipeline that can undercut the validity of study claims (Olteanu et al, 2016)
Similar critiques have been made by Jacobs and Wallach (2019). They argue that measurement theory, a particular domain of social theory engaging in the validity and reliability of different ways of measuring social constructs, can provide a concrete and useful language with which different definitions of fairness, and the impacts of algorithms, can be assessed

Summary

INTRODUCTION

Machine learning is increasingly being applied to vast quantities of social data generated from and about people (Lazer et al, 2009). Scholars have argued that machine learning models applied to social data often do not account for myriad biases that arise during the analysis pipeline that can undercut the validity of study claims (Olteanu et al, 2016). We argue and show that at each step of the machine learning pipeline, problems arise which cannot be solved using a technical solution alone. We explain how social theory helps us solve problems that arise throughout the process of building and evaluating machine learning models for social data.

RELATED WORK

THEORY IN

Problem Selection and Framing

Outcome Definition

Data Selection

Feature Engineering

Annotation

Model Construction

THEORY OUT

Generalizability

Parsimony

Fairness

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in big data	Publication Date: May 19, 2020
Citations: 35	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Theory In, Theory Out: The Uses of Social Theory in Machine Learning for Social Science.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in big data

Lead the way for us

Similar Papers

Effects of explanation and counterexplanation on the development and use of social theories.
Craig A Anderson ... Elizabeth S Sechler
Journal of Personality and Social Psychology | VOL. 50
Craig A Anderson, et. al.Craig A Anderson ... Elizabeth S Sechler
01 Jan 1986
Journal of Personality and Social Psychology | VOL. 50

The SMART Framework: Selection of Machine Learning Algorithms With ReplicaTions-A Case Study on the Microvascular Complications of Diabetes.
Breanna P Swan ... Julie S Ivy
IEEE Journal of Biomedical and Health Informatics | VOL. 26
Breanna P Swan, et. al.Breanna P Swan ... Julie S Ivy
01 Feb 2022
IEEE Journal of Biomedical and Health Informatics | VOL. 26

Panoramic imaging errors in machine learning model development: a systematic review.
Eduardo Delamare ... Xingyue Fu
Dento maxillo facial radiology | VOL. 53
Eduardo Delamare, et. al.Eduardo Delamare ... Xingyue Fu
25 Jan 2024
Dento maxillo facial radiology | VOL. 53

Predicting and Investigating the Permeability Coefficient of Soil with Aided Single Machine Learning Algorithm
Van Quan Tran ... Xiaoan Yan
Complexity | VOL. 2022
Van Quan Tran, et. al.Van Quan Tran ... Xiaoan Yan
01 Jan 2021
Complexity | VOL. 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Theory In, Theory Out: The Uses of Social Theory in Machine Learning for Social Science.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in big data