Abstract WP331: Identifying Stroke Subtypes with High Accuracy Using Machine Learning and Icd-9 Claims Data

Charles Esenwa,Hooman Kamel,Jorge Luna,Benjamin Kummer,Hojjat Salmasian,David Vawdrey,Mitchell Elkind

doi:10.1161/str.48.suppl_1.wp331

Abstract

Introduction: Stroke research using widely available institutional, state-wide and national retrospective data is dependent on accurate identification of stroke subtypes using claims data. Despite the abundance of such data and the advances in clinical informatics, there is limited published data on the application of machine learning models to improve previously reported administrative stroke identification algorithms. Hypothesis: We hypothesized that machine learning models can be applied to claims data coded using the International Classification of Disease, version 9 (ICD-9), to accuracy identify patients with ischemic stroke (IS), intracerebral hemorrhage (ICH), and subarachnoid hemorrhage (SAH), and these models would outperform previously published algorithms in our patient cohort. Methods: We developed a gold standard list of 427 stroke patients continuously admitted to our institution from 1/1/2015 to 9/30/2015 using an internal stroke database and applied 75% of it to train and 25% to test two machine learning models: one using classification and regression tree (CART) and another using regularized logistic regression. There were 2,241 negative controls. We further applied a previously reported stroke detection algorithm, by Tirschwell and Longstreth, to our cohort for comparison. Results: The CART model had a κ of 0.72, 0.82, 0.59; sensitivity of 95%, 99%, 99%; and a specificity of 88%, 78%, 75%; for IS, ICH and SAH respectively. The regularized logistic regression model had a κ of 0.73, 0.80, 0.59; sensitivity of 95%, 99%, 99%, and a specificity of 89%, 78%, 75%; for IS, ICH and SAH respectively. The previously reported algorithm by Tirschwell et al, had a κ of 0.71,0.56, 0.64; sensitivity of 98%, 99%, 99%; and a specificity of 64%, 52%, 50%; for IS, ICH and SAH. Conclusion: Compared with the previously reported ICD 9 based detection algorithm, the machine learning models had a higher κ for diagnosis of IS and ICH, similar sensitivity for all subtypes, and higher specificity for all stroke subtypes in our cohort. Applying machine learning models to identify stroke subtypes from administrative data sets, can lead to highly accurate models of stroke subtype identification for health services researchers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Abstract WP331: Identifying Stroke Subtypes with High Accuracy Using Machine Learning and Icd-9 Claims Data

Abstract

Talk to us

Similar Papers

More From: Stroke

Lead the way for us

Similar Papers

Abstract WP312: Identifying Acute Ischemic Stroke by Analyzing Icd-10 Claims Data Using Machine Learning Models
Charles Esenwa ... David Vawdrey
Stroke | VOL. 48
Charles Esenwa, et. al.Charles Esenwa ... David Vawdrey
01 Feb 2017
Abstract WP312: Identifying Acute Ischemic Stroke by Analyzing Icd-10 Claims Data Using Machine Learning Models
Charles Esenwa ... David Vawdrey

Prevalence, risk factors and prognostic value of atrial fibrillation detected after stroke after haemorrhagic versus ischaemic stroke
Jiahuan Guo ... Xingquan Zhao
Stroke and Vascular Neurology | VOL. -
Jiahuan Guo, et. al.Jiahuan Guo ... Xingquan Zhao
16 Feb 2024
Stroke and Vascular Neurology | VOL. -

Response
D Gaist
Stroke | VOL. -
D GaistD Gaist
15 May 2003
Stroke | VOL. -

P3718Identification of nine genes as novel susceptibility loci for early-onset ischemic stroke, intracerebral hemorrhage, or subarachnoid hemorrhage
Y Yamase ... J Sakuma
European Heart Journal | VOL. 40
Y Yamase, et. al.Y Yamase ... J Sakuma
01 Oct 2019
European Heart Journal | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abstract WP331: Identifying Stroke Subtypes with High Accuracy Using Machine Learning and Icd-9 Claims Data

Abstract

Talk to us

Similar Papers

More From: Stroke