Neural networks versus Logistic regression for 30\u2009days all-cause readmission prediction

Ahmed Allam,Michael Krauthammer,George Thoma,Mate Nagy

doi:10.1038/s41598-019-45685-z

Abstract

Heart failure (HF) is one of the leading causes of hospital admissions in the US. Readmission within 30 days after a HF hospitalization is both a recognized indicator for disease progression and a source of considerable financial burden to the healthcare system. Consequently, the identification of patients at risk for readmission is a key step in improving disease management and patient outcome. In this work, we used a large administrative claims dataset to (1) explore the systematic application of neural network-based models versus logistic regression for predicting 30 days all-cause readmission after discharge from a HF admission, and (2) to examine the additive value of patients’ hospitalization timelines on prediction performance. Based on data from 272,778 (49% female) patients with a mean (SD) age of 73 years (14) and 343,328 HF admissions (67% of total admissions), we trained and tested our predictive readmission models following a stratified 5-fold cross-validation scheme. Among the deep learning approaches, a recurrent neural network (RNN) combined with conditional random fields (CRF) model (RNNCRF) achieved the best performance in readmission prediction with 0.642 AUC (95% CI, 0.640–0.645). Other models, such as those based on RNN, convolutional neural networks and CRF alone had lower performance, with a non-timeline based model (MLP) performing worst. A competitive model based on logistic regression with LASSO achieved a performance of 0.643 AUC (95% CI, 0.640–0.646). We conclude that data from patient timelines improve 30 day readmission prediction, that a logistic regression with LASSO has equal performance to the best neural network model and that the use of administrative data result in competitive performance compared to published approaches based on richer clinical datasets.

Highlights

Heart failure (HF) is one of the leading causes for hospital admissions in the US1–4 with high numbers of readmissions within 30 days of discharge[2,3,4]
Starting from recurrent neural networks (RNN), the models trained with loss functions incorporating/emphasizing the loss from last HF event (i.e LastHF and Convex_HF_LastHF) achieved higher performance 0.636 and 0.635 area under the ROC curve (AUC) respectively compared to other loss function definitions
For the best neural model (RNNCRF), we report the analysis of feature importance using a similar approach to the one in[12]

Summary

Introduction

Heart failure (HF) is one of the leading causes for hospital admissions in the US1–4 with high numbers of readmissions within 30 days of discharge[2,3,4]. One specific aim of this study is to examine the value of including a patient’s trajectory data in a 30 day readmission prediction model To this end, we examine three approaches for modeling the problem of which two use the temporal information encoded in the patients’ trajectories (sequence labeling and sequence classification), and one that does not (index event classification). We implemented multiple neural network models with varying architectures and objective functions such as recurrent neural networks (RNN), and convolutional neural networks (CNN) as examples of sequence labeling and classification approaches, and multilayer perceptron (MLP) along with logistic regression as baseline models representing the index event classification approach We conducted these studies with a large administrative claims dataset, which lacks the detailed clinical information found in datasets typically used for this problem. As claims data are readily available and can be robustly harmonized, they pose less privacy concerns and are ideally suited for tacking the HF readmission problem

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Jun 26, 2019
Citations: 55	License type: open-access

R Discovery Prime

R Discovery Prime

Neural networks versus Logistic regression for 30\u2009days all-cause readmission prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Measuring Quality in Heart Failure
Robert O Bonow
Circulation: Cardiovascular Quality and Outcomes | VOL. 1
Robert O BonowRobert O Bonow
01 Sep 2008
Circulation: Cardiovascular Quality and Outcomes | VOL. 1

Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.
Ryan G Short ... Dave Bogaty
Journal of digital imaging | VOL. 32
Ryan G Short, et. al.Ryan G Short ... Dave Bogaty
18 Oct 2018
Journal of digital imaging | VOL. 32

Applying data science approaches to identify frequent flyers in heart failure: rise of the machines.
Andrew P Ambrosy ... Keane K Lee
European journal of heart failure | VOL. 21
Andrew P Ambrosy, et. al.Andrew P Ambrosy ... Keane K Lee
07 Feb 2019
European journal of heart failure | VOL. 21

Identifying Predictors of Heart Failure Readmission in Patients From a Statutory Health Insurance Database: Retrospective Machine Learning Study (Preprint)
Rebecca T Levinson ... Andreas D Meid
-
Rebecca T Levinson, et. al.Rebecca T Levinson ... Andreas D Meid
29 Nov 2023
29 Nov 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural networks versus Logistic regression for 30\u2009days all-cause readmission prediction

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports