A corpus-driven comparative analysis of AI in academic discourse: Investigating ChatGPT-generated academic texts in social sciences

Giordano Tudino,Yan Qin

doi:10.1016/j.lingua.2024.103838

Abstract

Since its release in 2022, ChatGPT has found widespread application across various disciplines. While previous studies on Generative AI’s capabilities have predominantly concentrated on content quality assessments, little attention has been directed toward investigating the model’s linguistic patterns compared to human-generated language. To address this gap, we built two specialized corpora comprised of academic texts in social sciences generated by ChatGPT-4o mini and selected the Elsevier OA CC-BY Corpus as a reference for comparison, with a view to identifying commonalities and differences between AI-generated and human academic language and determining whether academic language instructions improve the model’s output in terms of formal rigor. The findings revealed limitations in ChatGPT’s handling of academic discourse in the following respects: overuse of infrequent “academic” vocabulary, limited use of subordination, and syntactic and semantic homogeneity. Besides, the effect of specific language-oriented prompts is primarily reflected in minor lexical adjustments. This study expands the scope of corpus linguistics research by incorporating AI-generated texts into the analytical framework and lays the groundwork for future improvements in the language model’s genre discrimination.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A corpus-driven comparative analysis of AI in academic discourse: Investigating ChatGPT-generated academic texts in social sciences

Abstract

Talk to us

Similar Papers

More From: Lingua

Lead the way for us

Similar Papers

Gaplessness in Chinese relative clauses
Chen Li ... Seung-Man Kang
Lingua | VOL. 312
Chen Li, et. al.Chen Li ... Seung-Man Kang
01 Dec 2024
Lingua | VOL. 312

A corpus-driven comparative analysis of AI in academic discourse: Investigating ChatGPT-generated academic texts in social sciences
Giordano Tudino ... Yan Qin
Lingua | VOL. 312
Giordano Tudino, et. al.Giordano Tudino ... Yan Qin
01 Dec 2024
Lingua | VOL. 312

Dependencies between adverbs and sentence-final particles: A case of confirmative and non-confirmative modals in Cantonese
Peppina Po-Lun Lee
Lingua | VOL. 312
Peppina Po-Lun LeePeppina Po-Lun Lee
01 Dec 2024
Lingua | VOL. 312

Towards a system of principles for identifying nominalizing metaphors
Wen Li ... Bingjun Yang
Lingua | VOL. 312
Wen Li, et. al.Wen Li ... Bingjun Yang
01 Dec 2024
Lingua | VOL. 312

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A corpus-driven comparative analysis of AI in academic discourse: Investigating ChatGPT-generated academic texts in social sciences

Abstract

Talk to us

Similar Papers

More From: Lingua