Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.

Ryan Ying Cong Tan ,Han Jieh Tey,Qian Lin,Iain Bee Huat Tan,Daniel Shao Weng Tan ,Wei Chong Tan,Lionel Tim-Ee Cheng,Wen Yee Chay,Hong Qi Tan,Guat Hwa Low,Chin Sung Chang ,Gillianne Lai ,Fun Loon Leong,Hwee Tou Ng,Melvin L.k Chua ,Ruixi Lin,Fuh Yong Wong,Choon Hua Thng,David Tai ,W Y Çhan ,Wen Long Nei,Terence Goh ,Fung Fung Lee

doi:10.1093/jamia/ocad133

Abstract

To assess large language models on their ability to accurately infer cancer disease response from free-text radiology reports. We assembled 10602 computed tomography reports from cancer patients seen at a single institution. All reports were classified into: no evidence of disease, partial response, stable disease, or progressive disease. We applied transformer models, a bidirectional long short-term memory model, a convolutional neural network model, and conventional machine learning methods to this task. Data augmentation using sentence permutation with consistency loss as well as prompt-based fine-tuning were used on the best-performing models. Models were validated on a hold-out test set and an external validation set based on Response Evaluation Criteria in Solid Tumors (RECIST) classifications. The best-performing model was the GatorTron transformer which achieved an accuracy of 0.8916 on the test set and 0.8919 on the RECIST validation set. Data augmentation further improved the accuracy to 0.8976. Prompt-based fine-tuning did not further improve accuracy but was able to reduce the number of training reports to 500 while still achieving good performance. These models could be used by researchers to derive progression-free survival in large datasets. It may also serve as a decision support tool by providing clinicians an automated second opinion of disease response. Large clinical language models demonstrate potential to infer cancer disease response from radiology reports at scale. Data augmentation techniques are useful to further improve performance. Prompt-based fine-tuning can significantly reduce the size of the training dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the American Medical Informatics Association	Publication Date: Jul 14, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association

Lead the way for us

Similar Papers

Automated anonymization of radiology reports: comparison of publicly available natural language processing and large language models.
Marcel C Langenbach ... Julius C Heemelaar
European radiology | VOL. -
Marcel C Langenbach, et. al.Marcel C Langenbach ... Julius C Heemelaar
31 Oct 2024
European radiology | VOL. -

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Evaluating large language models for health-related text classification tasks with public social media data.
Yuting Guo ... Abeed Sarker
Journal of the American Medical Informatics Association : JAMIA | VOL. -
Yuting Guo, et. al.Yuting Guo ... Abeed Sarker
09 Aug 2024
Journal of the American Medical Informatics Association : JAMIA | VOL. -

Enhancing ID-based Recommendation with Large Language Models
Lei Chen ... Meng Wang
ACM Transactions on Information Systems | VOL. -
Lei Chen, et. al.Lei Chen ... Meng Wang
13 Nov 2024
ACM Transactions on Information Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inferring cancer disease response from radiology reports using large language models with data augmentation and prompting.

Abstract

Talk to us

Similar Papers

More From: Journal of the American Medical Informatics Association