Automated classification of brain MRI reports using fine-tuned large language models.

Jun Kanzawa,Nana Fujita,Shin Fujiwara,Osamu Abe,Koichiro Yasaka

doi:10.1007/s00234-024-03427-7

Abstract

This study aimed to investigate the efficacy of fine-tuned large language models (LLM) in classifying brain MRI reports into pretreatment, posttreatment, and nontumor cases. This retrospective study included 759, 284, and 164 brain MRI reports for training, validation, and test dataset. Radiologists stratified the reports into three groups: nontumor (group 1), posttreatment tumor (group 2), and pretreatment tumor (group 3) cases. A pretrained Bidirectional Encoder Representations from Transformers Japanese model was fine-tuned using the training dataset and evaluated on the validation dataset. The model which demonstrated the highest accuracy on the validation dataset was selected as the final model. Two additional radiologists were involved in classifying reports in the test datasets for the three groups. The model's performance on test dataset was compared to that of two radiologists. The fine-tuned LLM attained an overall accuracy of 0.970 (95% CI: 0.930-0.990). The model's sensitivity for group 1/2/3 was 1.000/0.864/0.978. The model's specificity for group1/2/3 was 0.991/0.993/0.958. No statistically significant differences were found in terms of accuracy, sensitivity, and specificity between the LLM and human readers (p ≥ 0.371). The LLM completed the classification task approximately 20-26-fold faster than the radiologists. The area under the receiver operating characteristic curve for discriminating groups 2 and 3 from group 1 was 0.994 (95% CI: 0.982-1.000) and for discriminating group 3 from groups 1 and 2 was 0.992 (95% CI: 0.982-1.000). Fine-tuned LLM demonstrated a comparable performance with radiologists in classifying brain MRI reports, while requiring substantially less time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neuroradiology	Publication Date: Jul 12, 2024
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Automated classification of brain MRI reports using fine-tuned large language models.

Abstract

Talk to us

Similar Papers

More From: Neuroradiology

Lead the way for us

Similar Papers

Machine Learning Can be Used to Predict Function but Not Pain After Surgery for Thumb Carpometacarpal Osteoarthritis.
Nina L Loos ... Lisa Hoogendam
Clinical Orthopaedics & Related Research | VOL. 480
Nina L Loos, et. al.Nina L Loos ... Lisa Hoogendam
18 Jan 2022
Clinical Orthopaedics & Related Research | VOL. 480

Fine-Tuned Large Language Model for Extracting Patients on Pretreatment for Lung Cancer from a Picture Archiving and Communication System Based on Radiological Reports.
Koichiro Yasaka ... Osamu Abe
Journal of imaging informatics in medicine | VOL. -
Koichiro Yasaka, et. al.Koichiro Yasaka ... Osamu Abe
02 Jul 2024
Journal of imaging informatics in medicine | VOL. -

Automated coronary artery calcium scoring using nested U-Net and focal loss
Jia-Sheng Hong ... Yu-Te Wu
Computational and Structural Biotechnology Journal | VOL. 20
Jia-Sheng Hong, et. al.Jia-Sheng Hong ... Yu-Te Wu
01 Jan 2021
Computational and Structural Biotechnology Journal | VOL. 20

Artificial intelligence-based detection of atrial fibrillation from chest radiographs.
Toshimasa Matsumoto ... Yukio Miki
European Radiology | VOL. 32
Toshimasa Matsumoto, et. al.Toshimasa Matsumoto ... Yukio Miki
31 Mar 2022
European Radiology | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automated classification of brain MRI reports using fine-tuned large language models.

Abstract

Talk to us

Similar Papers

More From: Neuroradiology