Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.

Ayoub Bagheri,Folkert W Asselbergs,Saskia Haitjema,Wouter B Veldhuis,T Katrien J Groenhof,Pim A De Jong,Michiel L Bots,Daniel L Oberski

doi:10.1155/2021/6663884

Abstract

Methods We used EHR data of patients included in the Second Manifestations of ARTerial disease (SMART) study. We propose a deep learning-based multimodal architecture for our text mining pipeline that integrates neural text representation with preprocessed clinical predictors for the prediction of recurrence of major cardiovascular events in cardiovascular patients. Text preprocessing, including cleaning and stemming, was first applied to filter out the unwanted texts from X-ray radiology reports. Thereafter, text representation methods were used to numerically represent unstructured radiology reports with vectors. Subsequently, these text representation methods were added to prediction models to assess their clinical relevance. In this step, we applied logistic regression, support vector machine (SVM), multilayer perceptron neural network, convolutional neural network, long short-term memory (LSTM), and bidirectional LSTM deep neural network (BiLSTM). Results We performed various experiments to evaluate the added value of the text in the prediction of major cardiovascular events. The two main scenarios were the integration of radiology reports (1) with classical clinical predictors and (2) with only age and sex in the case of unavailable clinical predictors. In total, data of 5603 patients were used with 5-fold cross-validation to train the models. In the first scenario, the multimodal BiLSTM (MI-BiLSTM) model achieved an area under the curve (AUC) of 84.7%, misclassification rate of 14.3%, and F1 score of 83.8%. In this scenario, the SVM model, trained on clinical variables and bag-of-words representation, achieved the lowest misclassification rate of 12.2%. In the case of unavailable clinical predictors, the MI-BiLSTM model trained on radiology reports and demographic (age and sex) variables reached an AUC, F1 score, and misclassification rate of 74.5%, 70.8%, and 20.4%, respectively. Conclusions Using the case study of routine care chest X-ray radiology reports, we demonstrated the clinical relevance of integrating text features and classical predictors in our text mining pipeline for cardiovascular risk prediction. The MI-BiLSTM model with word embedding representation appeared to have a desirable performance when trained on text data integrated with the clinical variables from the SMART study. Our results mined from chest X-ray reports showed that models using text data in addition to laboratory values outperform those using only known clinical predictors.

Highlights

Electronic health records (EHRs) data have become increasingly available to researchers as more hospitals, clinics, and practices have adopted data digitization
EHRs store data in different modalities, such as structured data and unstructured texts. is digitization creates an opportunity to mine the health records to increase the quality of care and clinical outcomes
Patients visiting the University Medical Center (UMC) Utrecht for evaluation of any atherosclerotic cardiovascular condition are eligible for inclusion in Second Manifestations of ARTerial disease (SMART). e inclusion criteria are presenting with an atherosclerotic cardiovascular condition and age >18 years

Summary

Introduction

Electronic health records (EHRs) data have become increasingly available to researchers as more hospitals, clinics, and practices have adopted data digitization. We sought to assess the integration of various text representation approaches and clinical structured data with state-of-the-art deep learning methods in the process of medical text mining. We propose a deep learning-based multimodal architecture for our text mining pipeline that integrates neural text representation with preprocessed clinical predictors for the prediction of recurrence of major cardiovascular events in cardiovascular patients. These text representation methods were added to prediction models to assess their clinical relevance In this step, we applied logistic regression, support vector machine (SVM), multilayer perceptron neural network, convolutional neural network, long short-term memory (LSTM), and bidirectional LSTM deep neural network (BiLSTM). Using the case study of routine care chest X-ray radiology reports, we demonstrated the clinical relevance of integrating text features and classical predictors in our text mining pipeline for cardiovascular risk prediction. Our results mined from chest X-ray reports showed that models using text data in addition to laboratory values outperform those using only known clinical predictors

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of healthcare engineering	Publication Date: Jul 9, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of healthcare engineering

Lead the way for us

Similar Papers

Multimodal Learning for Cardiovascular Risk Prediction using EHR Data
Ayoub Bagheri ... T Katrien J Groenhof
-
Ayoub Bagheri, et. al.Ayoub Bagheri ... T Katrien J Groenhof
21 Sep 2020
21 Sep 2020

VLDL cholesterol associates with higher plasmatic expression of inflammatory proteins and atherosclerotic pathways compared to LDL cholesterol
Alberico L Catapano ... Nick Nurmohamed
European Atherosclerosis Journal | VOL. 2
Alberico L Catapano, et. al.Alberico L Catapano ... Nick Nurmohamed
31 Dec 2024
European Atherosclerosis Journal | VOL. 2

Limited benefit of haemoglobin glycation index as risk factor for cardiovascular disease in type 2 diabetes patients
H.B Østergaard ... J Westerink
Diabetes & Metabolism | VOL. 45
H.B Østergaard, et. al.H.B Østergaard ... J Westerink
08 May 2018
Diabetes & Metabolism | VOL. 45

Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula.
Frank K Wacker ... Jan B Hinrichs
RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren | VOL. 195
Frank K Wacker, et. al.Frank K Wacker ... Jan B Hinrichs
09 May 2023
RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren | VOL. 195

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of healthcare engineering