754 Prediction of 30-day All-Cause Readmission of Neurosurgery Patients Using Large Language Models

Lavender Jiang,Cordelia Marcela Orillac,Nora Chung Kim,Chris Liu,David B Kurland,Sean N Neifert,Howard A Riina,Ming Cao,Kyunghyun Cho,Yosef Michael Dastagirzada,Madeline Miceli,Anthony Costa,Nima Pour Nejatian,Alexander Cheung,Eric Karl Oermann,Mustafa Nasir-Moin,Duo Wang,Zane Schnurman,Paawan Punjabi,Hannah Weiss,Douglas S Kondziolka,Mona Flores,Yindalon Aphinyanaphongs,Anas Abidin,Ilya Laufer,Kevin Eaton,Christopher Livia,Grace Yang

doi:10.1227/neu.0000000000002809_754

Abstract

INTRODUCTION: Existing clinical prediction algorithms mostly leverage small cohorts of structured data (e.g., medical imaging or laboratory data). However, large language models have demonstrated the ability to utilize unstructured data to outperform other machine learning approaches given sufficient data. Training large language models on unstructured clinical notes offers a possible alternative to structured data algorithm development in clinical tasks. METHODS: An unlabeled dataset of over seven million unstructured clinical notes (e.g., radiology reports and patient histories) was collected from four hospitals within the NYU Langone Health (NYULH) system and utilized to pre-train a bidirectional encoder representation with transformer (BERT) model. This model was further fine-tuned using a labeled dataset of discharge summaries to predict 30-day all-cause readmission. The resulting model, termed NYUTron, was assessed using a held-out retrospective cohort of patients from June to December 2021. RESULTS: Over the period of the retrospective study, there were a total 1,072 neurosurgery patients. The BERT model achieved an area under the receiver operating curve of 0.7883, a recall of 95.1% at a precision of 27.3%, and an accuracy of 82.1%. CONCLUSIONS: This study demonstrates how large language models and unstructured clinical notes can be used to provide information to physicians on clinical tasks with a flexible framework that is amenable to modification for other clinical tasks.

Full Text