Reliability of ChatGPT for performing triage task in the emergency department using the Korean Triage and Acuity Scale.

Jae Hyuk Kim,Youngho Lee,Jongmyung Choi,Sun Kyung Kim

doi:10.1177/20552076241227132

Jae Hyuk Kim, Youngho Lee + Show 2 more

Open Access

https://doi.org/10.1177/20552076241227132

Copy DOI

Journal: Digital health	Publication Date: Jan 1, 2024
Citations: 10	License type: CC BY-NC-ND 4.0

Affiliation: Mokpo National University

Abstract

Artificial intelligence (AI) technology can enable more efficient decision-making in healthcare settings. There is a growing interest in improving the speed and accuracy of AI systems in providing responses for given tasks in healthcare settings. This study aimed to assess the reliability of ChatGPT in determining emergency department (ED) triage accuracy using the Korean Triage and Acuity Scale (KTAS). Two hundred and two virtual patient cases were built. The gold standard triage classification for each case was established by an experienced ED physician. Three other human raters (ED paramedics) were involved and rated the virtual cases individually. The virtual cases were also rated by two different versions of the chat generative pre-trained transformer (ChatGPT, 3.5 and 4.0). Inter-rater reliability was examined using Fleiss' kappa and intra-class correlation coefficient (ICC). The kappa values for the agreement between the four human raters and ChatGPTs were .523 (version 4.0) and .320 (version 3.5). Of the five levels, the performance was poor when rating patients at levels 1 and 5, as well as case scenarios with additional text descriptions. There were differences in the accuracy of the different versions of GPTs. The ICC between version 3.5 and the gold standard was .520, and that between version 4.0 and the gold standard was .802. A substantial level of inter-rater reliability was revealed when GPTs were used as KTAS raters. The current study showed the potential of using GPT in emergency healthcare settings. Considering the shortage of experienced manpower, this AI method may help improve triaging accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reliability of ChatGPT for performing triage task in the emergency department using the Korean Triage and Acuity Scale.

Abstract

Talk to us

Similar Papers

More From: Digital health

Lead the way for us

Similar Papers

Changes in Relative Importance of the 5-Level Triage System, Korean Triage and Acuity Scale, for the Disposition of Emergency Patients Induced by Forced Reduction in Its Level Number: a Multi-Center Registry-based Retrospective Cohort Study.
Ji-Ho Ryu ... Seong-Hwa Lee
Journal of Korean Medical Science | VOL. 34
Ji-Ho Ryu, et. al.Ji-Ho Ryu ... Seong-Hwa Lee
01 Jan 2019
Journal of Korean Medical Science | VOL. 34

MP13: Accuracy of Korean triage and acuity scale when pain severity is used as a modifier
M Kim ... J Park
CJEM | VOL. 20
M Kim, et. al.M Kim ... J Park
01 May 2018
CJEM | VOL. 20

Triage accuracy and causes of mistriage using the Korean Triage and Acuity Scale.
Sun-Hee Moon ... Jae Lan Shim
PLOS ONE | VOL. 14
Sun-Hee Moon, et. al.Sun-Hee Moon ... Jae Lan Shim
06 Sep 2019
PLOS ONE | VOL. 14

Over-triage occurs when considering the patient's pain in Korean Triage and Acuity Scale (KTAS).
Ji Hwan Lee ... Joon Min Park
PLOS ONE | VOL. 14
Ji Hwan Lee, et. al.Ji Hwan Lee ... Joon Min Park
09 May 2019
PLOS ONE | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reliability of ChatGPT for performing triage task in the emergency department using the Korean Triage and Acuity Scale.

Abstract

Talk to us

Similar Papers

More From: Digital health