Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports.

Bastien Le Guellec,Alexandre Lefèvre,Charlotte Geay,Lucas Shorten,Cyril Bruge,Lotfi Hacein-Bey,Philippe Amouyel,Jean-Pierre Pruvo,Gregory Kuchcinski,Aghiles Hamroun

doi:10.1148/ryai.230364

Abstract

Purpose To assess the performance of a local open-source large language model (LLM) in various information extraction tasks from real-life emergency brain MRI reports. Materials and Methods All consecutive emergency brain MRI reports written in 2022 from a French quaternary center were retrospectively reviewed. Two radiologists identified MRI scans that were performed in the emergency department for headaches. Four radiologists scored the reports' conclusions as either normal or abnormal. Abnormalities were labeled as either headache-causing or incidental. Vicuna (LMSYS Org), an open-source LLM, performed the same tasks. Vicuna's performance metrics were evaluated using the radiologists' consensus as the reference standard. Results Among the 2398 reports during the study period, radiologists identified 595 that included headaches in the indication (median age of patients, 35 years [IQR, 26-51 years]; 68% [403 of 595] women). A positive finding was reported in 227 of 595 (38%) cases, 136 of which could explain the headache. The LLM had a sensitivity of 98.0% (95% CI: 96.5, 99.0) and specificity of 99.3% (95% CI: 98.8, 99.7) for detecting the presence of headache in the clinical context, a sensitivity of 99.4% (95% CI: 98.3, 99.9) and specificity of 98.6% (95% CI: 92.2, 100.0) for the use of contrast medium injection, a sensitivity of 96.0% (95% CI: 92.5, 98.2) and specificity of 98.9% (95% CI: 97.2, 99.7) for study categorization as either normal or abnormal, and a sensitivity of 88.2% (95% CI: 81.6, 93.1) and specificity of 73% (95% CI: 62, 81) for causal inference between MRI findings and headache. Conclusion An open-source LLM was able to extract information from free-text radiology reports with excellent accuracy without requiring further training. Keywords: Large Language Model (LLM), Generative Pretrained Transformers (GPT), Open Source, Information Extraction, Report, Brain, MRI Supplemental material is available for this article. Published under a CC BY 4.0 license. See also the commentary by Akinci D'Antonoli and Bluethgen in this issue.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports.

Abstract

Talk to us

Similar Papers

More From: Radiology. Artificial intelligence

Lead the way for us

Journal: Radiology. Artificial intelligence	Publication Date: May 8, 2024
Citations: 7

Similar Papers

Performance of Large Language Models on a Neurology Board–Style Examination
Marc Cicero Schubert ... Varun Venkataramani
JAMA network open | VOL. 6
Marc Cicero Schubert, et. al.Marc Cicero Schubert ... Varun Venkataramani
07 Dec 2023
JAMA network open | VOL. 6

Large language model-based information extraction from free-text radiology reports: a scoping review protocol
Daniel Reichenpfader ... Kerstin Denecke
BMJ Open | VOL. 13
Daniel Reichenpfader, et. al.Daniel Reichenpfader ... Kerstin Denecke
01 Dec 2023
BMJ Open | VOL. 13

Evaluating the Performance of Large Language Models in Hematopoietic Stem Cell Transplantation Decision Making
Ivan Civettini ... Paola Perfetti
Blood | VOL. 142
Ivan Civettini, et. al.Ivan Civettini ... Paola Perfetti
02 Nov 2023
Blood | VOL. 142

Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.
... Fung Fung Lee
Journal of the American Medical Informatics Association | VOL. 30
, et. al. ... Fung Fung Lee
14 Jul 2023
Journal of the American Medical Informatics Association | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports.

Abstract

Talk to us

Similar Papers

More From: Radiology. Artificial intelligence