Stroke mortality prediction based on ensemble learning and the combination of structured and textual data

Ruixuan Huang,Jundong Liu,Tsz Kin Wan,Damrongrat Siriwanna,Yat Ming Peter Woo,Asmir Vodencarevic,Chi Wah Wong,Kei Hang Katie Chan

doi:10.1016/j.compbiomed.2022.106176

Abstract

For severe cerebrovascular diseases such as stroke, the prediction of short-term mortality of patients has tremendous medical significance. In this study, we combined machine learning models Random Forest classifier (RF), Adaptive Boosting (AdaBoost), Extremely Randomised Trees (ExtraTree) classifier, XGBoost classifier, TabNet, and DistilBERT to construct a multi-level prediction model that used bioassay data and radiology text reports from haemorrhagic and ischaemic stroke patients to predict six-month mortality. The performances of the prediction models were measured using the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPRC), precision, recall, and F1-score. The prediction models were built with the use of data from 19,616 haemorrhagic stroke patients and 50,178 ischaemic stroke patients. Novel six-month mortality prediction models for these patients were developed, which enhanced the performance of the prediction models by combining laboratory test data, structured data, and textual radiology report data. The achieved performances were as follows: AUROC = 0.89, AUPRC = 0.70, precision = 0.52, recall = 0.78, and F1 score = 0.63 for haemorrhagic patients, and AUROC = 0.88, AUPRC = 0.54, precision = 0.34, recall = 0.80, and F1 score = 0.48 for ischaemic patients. Such models could be used for mortality risk assessment and early identification of high-risk stroke patients. This could contribute to more efficient utilisation of healthcare resources for stroke survivors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stroke mortality prediction based on ensemble learning and the combination of structured and textual data

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine

Lead the way for us

Journal: Computers in Biology and Medicine	Publication Date: Oct 28, 2022
Citations: 11

Similar Papers

Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques
Jun Li ... Jiatuo Xu
International Journal of Medical Informatics | VOL. 149
Jun Li, et. al.Jun Li ... Jiatuo Xu
22 Feb 2021
International Journal of Medical Informatics | VOL. 149

Pediatric ECG-Based Deep Learning to Predict Left Ventricular Dysfunction and Remodeling.
Akhil Vaid ... William G La Cava
Circulation | VOL. 149
Akhil Vaid, et. al.Akhil Vaid ... William G La Cava
05 Feb 2024
Circulation | VOL. 149

The performance of VCS(volume, conductivity, light scatter) parameters in distinguishing latent tuberculosis and active tuberculosis by using machine learning algorithm
Lijiao Chen ... Shaoli Deng
BMC Infectious Diseases | VOL. 23
Lijiao Chen, et. al.Lijiao Chen ... Shaoli Deng
16 Dec 2023
BMC Infectious Diseases | VOL. 23

Machine Learning Electronic Health Record Identification of Patients with Rheumatoid Arthritis: Algorithm Pipeline Development and Validation Study.
Tjardo D Maarseveen ... Erik B Van Den Akker
JMIR Medical Informatics | VOL. 8
Tjardo D Maarseveen, et. al.Tjardo D Maarseveen ... Erik B Van Den Akker
30 Nov 2020
JMIR Medical Informatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stroke mortality prediction based on ensemble learning and the combination of structured and textual data

Abstract

Talk to us

Similar Papers

More From: Computers in Biology and Medicine