Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis

Danni Yu,Matteo Fuoli,Hang Su,Luyang Li

doi:10.1075/ijcl.23087.yu

Abstract

Abstract Certain forms of linguistic annotation, like part of speech and semantic tagging, can be automated with high accuracy. However, manual annotation is still necessary for complex pragmatic and discursive features that lack a direct mapping to lexical forms. This manual process is time-consuming and error-prone, limiting the scalability of function-to-form approaches in corpus linguistics. To address this, our study explores the possibility of using large language models (LLMs) to automate pragma-discursive corpus annotation. We compare GPT-3.5 (the model behind the free-to-use version of ChatGPT), GPT-4 (the model underpinning the precise mode of Bing chatbot), and a human coder in annotating apology components in English based on the local grammar framework. We find that GPT-4 outperformed GPT-3.5, with accuracy approaching that of a human coder. These results suggest that LLMs can be successfully deployed to aid pragma-discursive corpus annotation, making the process more efficient, scalable, and accessible.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis

Abstract

Talk to us

Similar Papers

More From: International Journal of Corpus Linguistics

Lead the way for us

Journal: International Journal of Corpus Linguistics	Publication Date: Jun 3, 2024
Citations: 1

Similar Papers

An Examination of the Use of Large Language Models to Aid Analysis of Textual Data
Robert H Tai ... Barnas G Monteith
International Journal of Qualitative Methods | VOL. 23
Robert H Tai, et. al.Robert H Tai ... Barnas G Monteith
01 Jan 2024
International Journal of Qualitative Methods | VOL. 23

Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.
Leyao Wang ... Zhijun Yin
Journal of medical Internet research | VOL. 26
Leyao Wang, et. al.Leyao Wang ... Zhijun Yin
07 Nov 2024
Journal of medical Internet research | VOL. 26

Large Language Models Outperform Expert Coders and Supervised Classifiers at Annotating Political Social Media Messages
Petter Törnberg
Social Science Computer Review | VOL. -
Petter TörnbergPetter Törnberg
22 Sep 2024
Social Science Computer Review | VOL. -

Do AIs know what the most important issue is? Using language models to code open-text social survey responses at scale
Jonathan Mellon ... Ralph Scott
Research & Politics | VOL. 11
Jonathan Mellon, et. al.Jonathan Mellon ... Ralph Scott
01 Jan 2024
Research & Politics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis

Abstract

Talk to us

Similar Papers

More From: International Journal of Corpus Linguistics