AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence-generated content

Yujie Sun,Dongfang Sheng,Zihan Zhou,Yifei Wu

doi:10.1057/s41599-024-03811-x

Abstract

Amidst the burgeoning information age, the rapid development of artificial intelligence-generated content (AIGC) has brought forth challenges regarding information authenticity. The proliferation of distorted information significantly impacts users negatively. This study aims to systematically categorize distorted information within AIGC, delve into its internal characteristics, and provide theoretical guidance for its management. Utilizing ChatGPT as a case study, we conducted empirical content analysis on 243 instances of distorted information collected, comprising both questions and answers. Three coders meticulously interpreted each instance of distorted information, encoding error points based on a predefined coding scheme and categorizing them according to error type. Our objective was to refine and validate the distorted information category list derived from the review through multiple rounds of pre-coding and test coding, thereby yielding a comprehensive and clearly delineated category list of distorted information in AIGC. The findings identified 8 first-level error types: “Overfitting”; “Logic errors”; “Reasoning errors”; “Mathematical errors”; “Unfounded fabrication”; “Factual errors”; “Text output errors”; and “Other errors”, further subdivided into 31 second-level error types. This classification list not only lays a solid foundation for studying risks associated with AIGC but also holds significant practical implications for helping users identify distorted information and enabling developers to enhance the quality of AI-generated tools.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence-generated content

Abstract

Talk to us

Similar Papers

More From: Humanities and Social Sciences Communications

Lead the way for us

Journal: Humanities and Social Sciences Communications	Publication Date: Sep 27, 2024
License type: cc-by-nc-nd

Similar Papers

The numbers game: A case study of mathematical literacy at a South African newspaper1
Robert Brand*
Communicatio | VOL. 34
Robert Brand*Robert Brand*
01 Jan 1970
Communicatio | VOL. 34

ERROR ANALYSIS IN SOLVING STRAIGHT MOTIONS' PROBLEMS OF SENIOR HIGH SCHOOL STUDENTS IN PEKANBARU
Vetty Vellancia ... Hendar Sudrajad
Jurnal Geliga Sains: Jurnal Pendidikan Fisika | VOL. 7
Vetty Vellancia, et. al.Vetty Vellancia ... Hendar Sudrajad
18 Apr 2020
Jurnal Geliga Sains: Jurnal Pendidikan Fisika | VOL. 7

Heuristic errors in clinical reasoning.
Melanie Rylander ... Jeannette Guerrasio
The Clinical Teacher | VOL. 13
Melanie Rylander, et. al.Melanie Rylander ... Jeannette Guerrasio
23 Sep 2015
The Clinical Teacher | VOL. 13

KESALAHAN BERBAHASA INDONESIA PADA SURAT DINAS DESA BATAN SEBAGAI MATERI AJAR BAHASA INDONESIA DI SMP
...
-
, et. al. ...
12 Dec 2019
12 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence-generated content

Abstract

Talk to us

Similar Papers

More From: Humanities and Social Sciences Communications