Abstract

Electronic Health Record (EHR) data can be a key resource for decision-making support in clinical practice in the “big data” era. The complete database from early 2012 to late 2015 involving hospital admissions to Inselspital Bern, the largest Swiss University Hospital, was used in this study, involving over 100,000 admissions. Age, sex, and initial laboratory test results were the features/variables of interest for each admission, the outcome being inpatient mortality. Computational decision support systems were utilized for the calculation of the risk of inpatient mortality. We assessed the recently proposed Acute Laboratory Risk of Mortality Score (ALaRMS) model, and further built generalized linear models, generalized estimating equations, artificial neural networks, and decision tree systems for the predictive modeling of the risk of inpatient mortality. The Area Under the ROC Curve (AUC) for ALaRMS marginally corresponded to the anticipated accuracy (AUC = 0.858). Penalized logistic regression methodology provided a better result (AUC = 0.872). Decision tree and neural network-based methodology provided even higher predictive performance (up to AUC = 0.912 and 0.906, respectively). Additionally, decision tree-based methods can efficiently handle Electronic Health Record (EHR) data that have a significant amount of missing records (in up to >50% of the studied features) eliminating the need for imputation in order to have complete data. In conclusion, we show that statistical learning methodology can provide superior predictive performance in comparison to existing methods and can also be production ready. Statistical modeling procedures provided unbiased, well-calibrated models that can be efficient decision support tools for predicting inpatient mortality and assigning preventive measures.

Highlights

  • The use of Electronic Health Records (EHR) for building mortality predictive models is a modern practice that is expected to enhance patient care by pointing physicians to patients at risk, that would potentially be missed in clinical routine [1]

  • We show that models that are constructed based on statistical learning methodology are well-calibrated and offer superior diagnostic accuracy to previous or traditional regression approaches

  • We demonstrated that statistical learning methodology could provide superior predictive performance in comparison to existing methods given that estimated ROC Area Under the ROC Curve (AUC) of the models

Read more

Summary

Methods

We present the experiment design which involves the extraction of the database and methodology used for modeling purposes. A total of 23 numeric laboratory test results were included for reasons of consistency with the construction of the ALaRMS model. Those were serum chemistry (: albumin, aspartate transaminase, alkaline phosphatase, blood urea nitrogen (BUN), calcium, creatinine, glucose, potassium (K), sodium (Na), and total bilirubin); hematology and coagulation parameters (bands, hemoglobin, partial thromboplastin time, prothrombin time international normalized ratio (PT INR), platelets, and white blood cell count (WBC)); arterial blood gas (partial pressure of carbon dioxide (pCO2), partial pressure of oxygen (pO2), and pH value); cardiac markers (brain natriuretic peptide (BNP) or NT-proBNP, creatine phosphokinase MB (CK MB), and troponin T (to replace troponin I in the score). An ethics dispensation from the cantonal ethics committee Bern (NoZ023/2014) was issued for the anonymized use of these data

Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call