Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity

Jeffrey L Jensen,Cole Tanigawa-Lau,Mai Oudah,Nizar Habash,Dhia Fairus Shofia Fani,Daniel Karell

doi:10.1177/00811750211053370

Abstract

Computational methods have become widespread in the social sciences, but probabilistic language models remain relatively underused. We introduce language models to a general social science readership. First, we offer an accessible explanation of language models, detailing how they estimate the probability of a piece of language, such as a word or sentence, on the basis of the linguistic context. Second, we apply language models in an illustrative analysis to demonstrate the mechanics of using these models in social science research. The example application uses language models to classify names in a large administrative database; the classifications are then used to measure a sociologically important phenomenon: the spatial variation of religiosity. This application highlights several advantages of language models, including their effectiveness in classifying text that contains variation around the base structures, as is often the case with localized naming conventions and dialects. We conclude by discussing language models’ potential to contribute to sociological research beyond classification through their ability to generate language.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity

Abstract

Talk to us

Similar Papers

More From: Sociological Methodology

Lead the way for us

Journal: Sociological Methodology	Publication Date: Oct 25, 2021
Citations: 5

Similar Papers

Social sciences research in neglected tropical diseases 3: Investment in social science research in neglected diseases of poverty: a case study of Bill and Melinda Gates Foundation
Subhash Pokhrel ... Daniel Reidpath
Health Research Policy and Systems | VOL. 9
Subhash Pokhrel, et. al.Subhash Pokhrel ... Daniel Reidpath
06 Jan 2011
Health Research Policy and Systems | VOL. 9

Don't Sell Social Science Short
Dennis S Ojima ... N T Hobbs
Science | VOL. 312
Dennis S Ojima, et. al.Dennis S Ojima ... N T Hobbs
09 Jun 2006
Science | VOL. 312

Opening up research in social sciences
Lucy Annette
Impact | VOL. 2020
Lucy AnnetteLucy Annette
30 Dec 2020
Impact | VOL. 2020

Opening up research in social sciences
Lucy Annette
Impact | VOL. 2021
Lucy AnnetteLucy Annette
26 Feb 2021
Impact | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language Models in Sociological Research: An Application to Classifying Large Administrative Data and Measuring Religiosity

Abstract

Talk to us

Similar Papers

More From: Sociological Methodology