Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

Emily J Mackay,Nimesh D Desai,William J Hanson,Michael D Stubna,Corey Chivers,Michael E Draugelis,Peter W Groeneveld,Thippa Reddy Gadekallu

doi:10.1371/journal.pone.0252585

Abstract

ObjectiveThis study aimed to develop and validate a claims-based, machine learning algorithm to predict clinical outcomes across both medical and surgical patient populations.MethodsThis retrospective, observational cohort study, used a random 5% sample of 770,777 fee-for-service Medicare beneficiaries with an inpatient hospitalization between 2009–2011. The machine learning algorithms tested included: support vector machine, random forest, multilayer perceptron, extreme gradient boosted tree, and logistic regression. The extreme gradient boosted tree algorithm outperformed the alternatives and was the machine learning method used for the final risk model. Primary outcome was 30-day mortality. Secondary outcomes were: rehospitalization, and any of 23 adverse clinical events occurring within 30 days of the index admission date.ResultsThe machine learning algorithm performance was evaluated by both the area under the receiver operating curve (AUROC) and Brier Score. The risk model demonstrated high performance for prediction of: 30-day mortality (AUROC = 0.88; Brier Score = 0.06), and 17 of the 23 adverse events (AUROC range: 0.80–0.86; Brier Score range: 0.01–0.05). The risk model demonstrated moderate performance for prediction of: rehospitalization within 30 days (AUROC = 0.73; Brier Score: = 0.07) and six of the 23 adverse events (AUROC range: 0.74–0.79; Brier Score range: 0.01–0.02). The machine learning risk model performed comparably on a second, independent validation dataset, confirming that the risk model was not overfit.Conclusions and relevanceWe have developed and validated a robust, claims-based, machine learning risk model that is applicable to both medical and surgical patient populations and demonstrates comparable predictive accuracy to existing risk models.

Highlights

Estimating risk is critical for decision making in both surgical and medical patient populations [1]
We have developed and validated a robust, claims-based, machine learning risk model that is applicable to both medical and surgical patient populations and demonstrates comparable predictive accuracy to existing risk models
Existing risk models are limited by an inability to rapidly obtain accurate information regarding patient risk [2,3,4], only apply to certain subsets of patient populations [2,3,4,5,6,7,8], and become outdated quickly because these models are not build to be continuously updated with new data [2,3,4,5,6,7,8]

Summary

Introduction

Estimating risk is critical for decision making in both surgical and medical patient populations [1]. While logistic regression risk models exist for both medical [5,6,7,8] and surgical populations [2,3,4], these risk models require time-consuming, manual data entry [2,3,4], only apply to limited subsets of patients Robust ML algorithms for prognostication have been developed primarily using electronic medical record (EMR) data [16,17,18] Such models’ reliance on EMR data, which is often proprietary and unique to a particular health system, makes implementing these prognostic tools across multiple health systems costly and challenging

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Jun 3, 2021
Citations: 17	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

Artificial Intelligence-related Literature in Transplantation: A Practical Guide.
Sook Hyeon Park ... Daniela P Ladner
Transplantation | VOL. 105
Sook Hyeon Park, et. al.Sook Hyeon Park ... Daniela P Ladner
18 Aug 2020
Transplantation | VOL. 105

Preoperative Surgical Risk Predictions Are Not Meaningfully Improved by Including the Surgical Apgar Score
Maxim A Terekhov ... Jesse M Ehrenfeld
Survey of Anesthesiology | VOL. 60
Maxim A Terekhov, et. al.Maxim A Terekhov ... Jesse M Ehrenfeld
01 Jun 2016
Survey of Anesthesiology | VOL. 60

Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population.
Jongseok Lee ... Chulho Kim
Journal of Personalized Medicine | VOL. 10
Jongseok Lee, et. al.Jongseok Lee ... Chulho Kim
20 Aug 2020
Journal of Personalized Medicine | VOL. 10

Increased Level of Interleukin 6 Associates With Increased 90-Day and 1-Year Mortality in Patients With End-Stage Liver Disease
Johannes Remmler ... Thorsten Kaiser
Clinical Gastroenterology and Hepatology | VOL. 16
Johannes Remmler, et. al.Johannes Remmler ... Thorsten Kaiser
14 Sep 2017
Clinical Gastroenterology and Hepatology | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE