Humans vs. large language models: Judgmental forecasting in an era of advanced AI

Mahdi Abolghasemi,Odkhishig Ganbold,Kristian Rotaru

doi:10.1016/j.ijforecast.2024.07.003

Abstract

This study investigates the forecasting accuracy of human experts versus large language models (LLMs) in the retail sector, particularly during standard and promotional sales periods. Utilizing a controlled experimental setup with 123 human forecasters and five LLMs—namely, ChatGPT-4, ChatGPT3.5, Bard, Bing, and Llama2—we evaluated forecasting precision through the absolute percentage error. Our analysis centered on the effect of the following factors on forecasters’ performance: the supporting statistical model (baseline and advanced), whether the product was on promotion, and the nature of external impact. The findings indicate that LLMs do not consistently outperform humans in forecasting accuracy and that advanced statistical forecasting models do not uniformly enhance the performance of either human forecasters or LLMs. Both human and LLM forecasters exhibited increased forecasting errors, particularly during promotional periods. Our findings call for careful consideration when integrating LLMs into practical forecasting processes.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Humans vs. large language models: Judgmental forecasting in an era of advanced AI

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Forecasting

Lead the way for us

Journal: International Journal of Forecasting	Publication Date: Oct 1, 2024
License type: cc-by

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... W Nick Street
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... W Nick Street
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Large Language Models: A Historical and Sociocultural Perspective.
Eugene Yu Ji
Cognitive science | VOL. 48
Eugene Yu JiEugene Yu Ji
01 Mar 2024
Cognitive science | VOL. 48

Comparative Analysis of Large Language Models and Spine Surgeons in Surgical Decision-Making and Radiological Assessment for Spine Pathologies
Ahmad K Almekkawi ... Carlos A Bagley
World Neurosurgery | VOL. -
Ahmad K Almekkawi, et. al.Ahmad K Almekkawi ... Carlos A Bagley
30 Nov 2024
World Neurosurgery | VOL. -

The Limitations of Large Language Models for Understanding Human Language and Cognition.
Christine Cuskley ... Molly Flaherty
Open mind : discoveries in cognitive science | VOL. 8
Christine Cuskley, et. al.Christine Cuskley ... Molly Flaherty
01 Jan 2024
Open mind : discoveries in cognitive science | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Humans vs. large language models: Judgmental forecasting in an era of advanced AI

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Forecasting