Taking It Easy: Off-the-Shelf Versus Fine-Tuned Supervised Modeling of Performance Appraisal Text

Andrew B Speer,James Perrotta,Tobias L Kordsmeyer

doi:10.1177/10944281241271249

Abstract

When assessing text, supervised natural language processing (NLP) models have traditionally been used to measure targeted constructs in the organizational sciences. However, these models require significant resources to develop. Emerging “off-the-shelf” large language models (LLM) offer a way to evaluate organizational constructs without building customized models. However, it is unclear whether off-the-shelf LLMs accurately score organizational constructs and what evidence is necessary to infer validity. In this study, we compared the validity of supervised NLP models to off-the-shelf LLM models (ChatGPT-3.5 and ChatGPT-4). Across six organizational datasets and thousands of comments, we found that supervised NLP produced scores were more reliable than human coders. However, and even though not specifically developed for this purpose, we found that off-the-shelf LLMs produce similar psychometric properties as supervised models, though with slightly less favorable psychometric properties. We connect these findings to broader validation considerations and present a decision chart to guide researchers and practitioners on how they can use off-the-shelf LLM models to score targeted constructs, including guidance on how psychometric evidence can be “transported” to new contexts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Taking It Easy: Off-the-Shelf Versus Fine-Tuned Supervised Modeling of Performance Appraisal Text

Abstract

Talk to us

Similar Papers

More From: Organizational Research Methods

Lead the way for us

Similar Papers

Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.
Marcel C Langenbach ... Julius C Heemelaar
European radiology | VOL. -
Marcel C Langenbach, et. al.Marcel C Langenbach ... Julius C Heemelaar
31 Oct 2024
European radiology | VOL. -

Understanding older people's voice interactions with smart voice assistants: a new modified rule-based natural language processing model with human input.
Zhengxu Yan ... Julie Blaskewicz Boron
Frontiers in digital health | VOL. 6
Zhengxu Yan, et. al.Zhengxu Yan ... Julie Blaskewicz Boron
01 Jan 2024
Frontiers in digital health | VOL. 6

Utilizing natural language processing and large language models in the diagnosis and prediction of infectious diseases: A systematic review
Mahmud Omar ... Eyal Klang
AJIC: American Journal of Infection Control | VOL. 52
Mahmud Omar, et. al.Mahmud Omar ... Eyal Klang
06 Apr 2024
AJIC: American Journal of Infection Control | VOL. 52

LLMs and NLP Models in Cryptocurrency Sentiment Analysis: A Comparative Classification Study
Konstantinos I Roumeliotis ... Dimitrios K Nasiopoulos
Big Data and Cognitive Computing | VOL. 8
Konstantinos I Roumeliotis, et. al.Konstantinos I Roumeliotis ... Dimitrios K Nasiopoulos
05 Jun 2024
Big Data and Cognitive Computing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Taking It Easy: Off-the-Shelf Versus Fine-Tuned Supervised Modeling of Performance Appraisal Text

Abstract

Talk to us

Similar Papers

More From: Organizational Research Methods