Synthetic Replacements for Human Survey Data? The Perils of Large Language Models

James Bisbee,Joshua D Clinton,Jennifer M Larson,Brenton Kenkel,Cassy Dorff

doi:10.1017/pan.2024.5

Abstract

Abstract Large language models (LLMs) offer new research possibilities for social scientists, but their potential as “synthetic data” is still largely unknown. In this paper, we investigate how accurately the popular LLM ChatGPT can recover public opinion, prompting the LLM to adopt different “personas” and then provide feeling thermometer scores for 11 sociopolitical groups. The average scores generated by ChatGPT correspond closely to the averages in our baseline survey, the 2016–2020 American National Election Study (ANES). Nevertheless, sampling by ChatGPT is not reliable for statistical inference: there is less variation in responses than in the real surveys, and regression coefficients often differ significantly from equivalent estimates obtained using ANES data. We also document how the distribution of synthetic responses varies with minor changes in prompt wording, and we show how the same prompt yields significantly different results over a 3-month period. Altogether, our findings raise serious concerns about the quality, reliability, and reproducibility of synthetic survey data generated by LLMs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Political Analysis	Publication Date: May 17, 2024
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Synthetic Replacements for Human Survey Data? The Perils of Large Language Models

Abstract

Talk to us

Similar Papers

More From: Political Analysis

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

National election studies and macro analysis
R.S Erikson
Electoral Studies | VOL. 21
R.S EriksonR.S Erikson
19 Jan 2002
Electoral Studies | VOL. 21

Mathematical Problem Solving in Arabic: Assessing Large Language Models
Abeer Mahgoub ... Elhassan Anas Elsabry
Procedia Computer Science | VOL. 244
Abeer Mahgoub, et. al.Abeer Mahgoub ... Elhassan Anas Elsabry
01 Jan 2024
Procedia Computer Science | VOL. 244

Large language models and synthetic health data: progress and prospects.
Daniel Smolyak ... Ritu Agarwal
JAMIA open | VOL. 7
Daniel Smolyak, et. al.Daniel Smolyak ... Ritu Agarwal
08 Oct 2024
JAMIA open | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synthetic Replacements for Human Survey Data? The Perils of Large Language Models

Abstract

Talk to us

Similar Papers

More From: Political Analysis