Advancement of Generative Pre-trained Transformer Chatbots in Answering Clinical Questions in the Practical Rhinoplasty Guideline.

Makoto Shiraishi,Saori Tsuruda,Yoko Tomioka,Jinwoo Chang,Asei Hori,Saaya Ishii,Rei Fujinaka,Taku Ando,Jun Ohba,Mutsumi Okazaki

doi:10.1007/s00266-024-04377-4

Abstract

The Generative Pre-trained Transformer (GPT) series, which includes ChatGPT, is an artificial large language model that provides human-like text dialogue. This study aimed to evaluate the performance of artificial intelligence chatbots in answering clinical questions based on practical rhinoplasty guidelines. Clinical questions (CQs) developed from the guidelines were used as question sources. For each question, we asked GPT-4 and GPT-3.5 (ChatGPT), developed by OpenAI, to provide answers for the CQs, Policy Level, Aggregate Evidence Quality, Level of Confidence in Evidence, and References. We compared the performance of the two types of artificial intelligence (AI) chatbots. A total of 10 questions were included in the final analysis, and the AI chatbots correctly answered 90.0% of these. GPT-4 demonstrated a lower accuracy rate than GPT-3.5 in answering CQs, although without statistically significant difference (86.0% vs. 94.0%; p = 0.05), whereas GPT-4 showed significantly higher accuracy for the level of confidence in Evidence than GPT-3.5 (52.0% vs. 28.0%; p < 0.01). No statistical differences were observed in Policy Level, Aggregate Evidence Quality, and Reference Match. In addition, GPT-4 rated significantly higher in presenting existing references than GPT-3.5 (36.9% vs. 24.1%; p = 0.01). The overall performance of GPT-4 was similar to that of GPT-3.5. However, GPT-4 provided existing references at a higher rate than GPT-3.5. GPT-4 has the potential to provide a more accurate reference in professional fields, including rhinoplasty. This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Advancement of Generative Pre-trained Transformer Chatbots in Answering Clinical Questions in the Practical Rhinoplasty Guideline.

Abstract

Talk to us

Similar Papers

More From: Aesthetic plastic surgery

Lead the way for us

Similar Papers

Appropriateness of Artificial Intelligence Chatbots in Diabetic Foot Ulcer Management.
Makoto Shiraishi ... Haesu Lee
The International Journal of Lower Extremity Wounds | VOL. -
Makoto Shiraishi, et. al.Makoto Shiraishi ... Haesu Lee
28 Feb 2024
The International Journal of Lower Extremity Wounds | VOL. -

Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots
Hong-Uyen Hua ... Danny A Mammo
JAMA ophthalmology | VOL. 141
Hong-Uyen Hua, et. al.Hong-Uyen Hua ... Danny A Mammo
27 Jul 2023
JAMA ophthalmology | VOL. 141

How Well Do Artificial Intelligence Chatbots Respond to the Top Search Queries About Urological Malignancies?
David Musheyev ... Abdo E Kabarriti
European urology | VOL. 85
David Musheyev, et. al.David Musheyev ... Abdo E Kabarriti
10 Aug 2023
European urology | VOL. 85

Do AI chatbots improve students learning outcomes? Evidence from a meta‐analysis
Rong Wu ... Zhonggen Yu
British Journal of Educational Technology | VOL. 55
Rong Wu, et. al.Rong Wu ... Zhonggen Yu
03 May 2023
British Journal of Educational Technology | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Advancement of Generative Pre-trained Transformer Chatbots in Answering Clinical Questions in the Practical Rhinoplasty Guideline.

Abstract

Talk to us

Similar Papers

More From: Aesthetic plastic surgery