Enhancing chatbot performance for imaging recommendations: Leveraging GPT-4 and context-awareness for trustworthy clinical guidance

Alexander Rau,Fabian Bamberg,Anna Fink,Phuong Hien Tran,Marco Reisert,Maximilian F Russe

doi:10.1016/j.ejrad.2024.111756

Abstract

PurposeTo investigate if GPT-4 improves the accuracy, consistency, and trustworthiness of a context-aware chatbot to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing: In addition, we sought to enable auditability of the output by revealing the information source the decision relies on. Material and MethodsWe refined an existing chatbot that incorporated specialized knowledge of the ACR guidelines by upgrading GPT-3.5-Turbo to its successor GPT-4 by OpenAI, using the latest version of LlamaIndex, and improving the prompting strategy. This chatbot was compared to the previous version, generic GPT-3.5-Turbo and GPT-4, and general radiologists regarding the performance in applying the ACR appropriateness guidelines. ResultsThe refined context-aware chatbot performed superior to the previous version using GPT-3.5-Turbo, generic chatbots GPT-3.5-Turbo and GPT-4, and general radiologists in providing “usually or may be appropriate” recommendations according to the ACR guidelines (all p < 0.001). It also outperformed GPT-3.5-Turbo and general radiologists in respect to “usually appropriate” recommendations (both p < 0.001). Moreover, the consistency in correct answers was higher with 78 % consistent correct “usually appropriate” answers and 94 % for “usually or may be appropriate” recommendations. In all cases, the same source documents were chosen, ensuring transparency. ConclusionOur study demonstrates the significance of context awareness in ensuring the use of appropriate knowledge and proposes a strategy to enhance trust in chatbot-based outputs to provide transparency. The improvements in accuracy, consistency, and source transparency address trust issues and enhance the clinical decision support process.Abbreviations: ACR, American College of Radiology; accGPT, appropriateness criteria context aware GPT; accGPT-4, appropriateness criteria context aware GPT using GPT-4; GPT, generative pre-trained transformer; LLM, Large Language Model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing chatbot performance for imaging recommendations: Leveraging GPT-4 and context-awareness for trustworthy clinical guidance

Abstract

Talk to us

Similar Papers

More From: European Journal of Radiology

Lead the way for us

Journal: European Journal of Radiology	Publication Date: Sep 24, 2024
License type: cc-by

Similar Papers

The utilization of computed tomography in the emergency department and appropriateness as determined by the american college of radiology appropriateness criteria: 2000 versus 2003
D.J Peter ... J.E Duldner
Annals of Emergency Medicine | VOL. 44
D.J Peter, et. al.D.J Peter ... J.E Duldner
25 Sep 2004
Annals of Emergency Medicine | VOL. 44

A Context-based Chatbot Surpasses Trained Radiologists and Generic ChatGPT in Following the ACR Appropriateness Guidelines.
Alexander Rau ... Hien Tran
Radiology | VOL. 308
Alexander Rau, et. al.Alexander Rau ... Hien Tran
01 Jul 2023
Radiology | VOL. 308

Comparison of professional medical society guidelines for appropriate use of coronary computed tomography angiography
Eileen Hu-Wang ... Marcus Y Chen
Journal of Cardiovascular Computed Tomography | VOL. 14
Eileen Hu-Wang, et. al.Eileen Hu-Wang ... Marcus Y Chen
03 Feb 2020
Journal of Cardiovascular Computed Tomography | VOL. 14

Comparison of Various Ultrasound-Based Malignant Risk Stratification Systems on an Occasion for Assessing Thyroid Nodules in Hashimoto's Thyroiditis.
Tianxue Zhao ... Shaokun Xu
International journal of general medicine | VOL. 16
Tianxue Zhao, et. al.Tianxue Zhao ... Shaokun Xu
01 Feb 2023
International journal of general medicine | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing chatbot performance for imaging recommendations: Leveraging GPT-4 and context-awareness for trustworthy clinical guidance

Abstract

Talk to us

Similar Papers

More From: European Journal of Radiology