Development of a liver disease-specific large language model chat interface using retrieval-augmented generation.

Jin Ge,Victor Galvez,Steve Sun,Mark J Pletcher,Jennifer C Lai,Oksana Gologorskaya,Joseph Owens,Ki Lai

doi:10.1097/hep.0000000000000834

Abstract

Large language models (LLMs) have significant capabilities in clinical information processing tasks. Commercially available LLMs, however, are not optimized for clinical uses and are prone to generating hallucinatory information. Retrieval-augmented generation (RAG) is an enterprise architecture that allows the embedding of customized data into LLMs. This approach "specializes" the LLMs and is thought to reduce hallucinations. We developed "LiVersa," a liver disease-specific LLM, by using our institution's protected health information-complaint text embedding and LLM platform, "Versa." We conducted RAG on 30 publicly available American Association for the Study of Liver Diseases guidance documents to be incorporated into LiVersa. We evaluated LiVersa's performance by conducting 2 rounds of testing. First, we compared LiVersa's outputs versus those of trainees from a previously published knowledge assessment. LiVersa answered all 10 questions correctly. Second, we asked 15 hepatologists to evaluate the outputs of 10 hepatology topic questions generated by LiVersa, OpenAI's ChatGPT 4, and Meta's Large Language Model Meta AI 2. LiVersa's outputs were more accurate but were rated less comprehensive and safe compared to those of ChatGPT 4. We evaluated LiVersa's performance by conducting 2 rounds of testing. First, we compared LiVersa's outputs versus those of trainees from a previously published knowledge assessment. LiVersa answered all 10 questions correctly. Second, we asked 15 hepatologists to evaluate the outputs of 10 hepatology topic questions generated by LiVersa, OpenAI's ChatGPT 4, and Meta's Large Language Model Meta AI 2. LiVersa's outputs were more accurate but were rated less comprehensive and safe compared to those of ChatGPT 4. In this demonstration, we built disease-specific and protected health information-compliant LLMs using RAG. While LiVersa demonstrated higher accuracy in answering questions related to hepatology, there were some deficiencies due to limitations set by the number of documents used for RAG. LiVersa will likely require further refinement before potential live deployment. The LiVersa prototype, however, is a proof of concept for utilizing RAG to customize LLMs for clinical use cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Development of a liver disease-specific large language model chat interface using retrieval-augmented generation.

Abstract

Talk to us

Similar Papers

More From: Hepatology (Baltimore, Md.)

Lead the way for us

Journal: Hepatology (Baltimore, Md.)	Publication Date: Mar 7, 2024
Citations: 19

Similar Papers

Large language models in neurosurgery: a systematic review and meta-analysis.
Advait Patil ... Kevin T Huang
Acta neurochirurgica | VOL. 166
Advait Patil, et. al.Advait Patil ... Kevin T Huang
23 Nov 2024
Acta neurochirurgica | VOL. 166

Assessing the research landscape and clinical utility of large language models: a scoping review.
Ye-Jean Park ... Christopher Naugler
BMC medical informatics and decision making | VOL. 24
Ye-Jean Park, et. al.Ye-Jean Park ... Christopher Naugler
12 Mar 2024
BMC medical informatics and decision making | VOL. 24

Evaluating the Performance of Large Language Models in Hematopoietic Stem Cell Transplantation Decision Making
Ivan Civettini ... Carlo Gambacorti-Passerini
Blood | VOL. 142
Ivan Civettini, et. al.Ivan Civettini ... Carlo Gambacorti-Passerini
02 Nov 2023
Blood | VOL. 142

Evaluating the Efficacy of AI Chatbots as Tutors in Urology: A Comparative Analysis of Responses to the 2022 In-Service Assessment of the European Board of Urology
Katharina Körner-Riffard ... Sabine D Brookman-May
Urologia Internationalis | VOL. 108
Katharina Körner-Riffard, et. al.Katharina Körner-Riffard ... Sabine D Brookman-May
30 Mar 2024
Urologia Internationalis | VOL. 108

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of a liver disease-specific large language model chat interface using retrieval-augmented generation.

Abstract

Talk to us

Similar Papers

More From: Hepatology (Baltimore, Md.)