Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

Yijun Shao,Yijun Shao,Yan Cheng,Yan Cheng,Sijian Zhang,Wen‐Chih Wu,Samir S Patel,Sijian Zhang,Anshul Parulkar,Paul A Heidenreich,Hans Moore,Anshul Parulkar,Venkatesh K Raman,Qing Zeng‐Treitler,Gregg C Fonarow,Wen‐Chih Wu,Ali Ahmed,Ali Ahmed,Qing Zeng‐Treitler,Paul A Heidenreich,Hans Moore,Helen M Sheriff,Phillip H Lam,Hans Moore,Hans Moore,Samir S Patel,Helen M Sheriff,Phillip H Lam,Phillip H Lam,Venkatesh K Raman,Ali Ahmed

doi:10.1002/ehf2.14787

Abstract

Heart failure (HF) is a clinical syndrome with no definitive diagnostic tests. HF registries are often based on manual reviews of medical records of hospitalized HF patients identified using International Classification of Diseases (ICD) codes. However, most HF patients are not hospitalized, and manual review of big electronic health record (EHR) data is not practical. The US Department of Veterans Affairs (VA) has the largest integrated healthcare system in the nation, and an estimated 1.5 million patients have ICD codes for HF (HF ICD-code universe) in their VA EHR. The objective of our study was to develop artificial intelligence (AI) models to phenotype HF in these patients. The model development cohort (n=20000: training, 16000; validation 2000; testing, 2000) included 10000 patients with HF and 10000 without HF who were matched by age, sex, race, inpatient/outpatient status, hospital, and encounter date (within 60days). HF status was ascertained by manual chart reviews in VA's External Peer Review Program for HF (EPRP-HF) and non-HF status was ascertained by the absence of ICD codes for HF in VA EHR. Two clinicians annotated 1000 random snippets with HF-related keywords and labelled 436 as HF, which was then used to train and test a natural language processing (NLP) model to classify HF (positive predictive value or PPV, 0.81; sensitivity, 0.77). A machine learning (ML) model using linear support vector machine architecture was trained and tested to classify HF using EPRP-HF as cases (PPV, 0.86; sensitivity, 0.86). From the 'HF ICD-code universe', we randomly selected 200 patients (gold standard cohort) and two clinicians manually adjudicated HF (gold standard HF) in 145 of those patients by chart reviews. We calculated NLP, ML, and NLP+ML scores and used weighted F scores to derive their optimal threshold values for HF classification, which resulted in PPVs of 0.83, 0.77, and 0.85 and sensitivities of 0.86, 0.88, and 0.83, respectively. HF patients classified by the NLP+ML model were characteristically and prognostically similar to those with gold standard HF. All three models performed better than ICD code approaches: one principal hospital discharge diagnosis code for HF (PPV, 0.97; sensitivity, 0.21) or two primary outpatient encounter diagnosis codes for HF (PPV, 0.88; sensitivity, 0.54). These findings suggest that NLP and ML models are efficient AI tools to phenotype HF in big EHR data to create contemporary HF registries for clinical studies of effectiveness, quality improvement, and hypothesis generation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

Abstract

Talk to us

Similar Papers

More From: ESC heart failure

Lead the way for us

Journal: ESC heart failure	Publication Date: Jun 14, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

RESEARCHComparing Strategies for Identifying Falls in Older Adult Emergency Department Visits Using EHR Data.
Brian W Patterson ... Eneida A Mendonça
Journal of the American Geriatrics Society | VOL. 68
Brian W Patterson, et. al.Brian W Patterson ... Eneida A Mendonça
20 Sep 2020
Journal of the American Geriatrics Society | VOL. 68

1274 Identification and characterization of immune checkpoint inhibitor induced immune-related adverse events from electronic health records using natural language processing techniques
Hannah Barman ... Sriram Venkateswaran
Journal for ImmunoTherapy of Cancer | VOL. 10
Hannah Barman, et. al.Hannah Barman ... Sriram Venkateswaran
01 Nov 2022
Journal for ImmunoTherapy of Cancer | VOL. 10

The role of natural language processing techniques versus conventional methods to gain ICI safety insights from unstructured EHR data.
Matthew Stephen Block ... Sriram Venkateswaran
JCO Global Oncology | VOL. 9
Matthew Stephen Block, et. al.Matthew Stephen Block ... Sriram Venkateswaran
01 Aug 2023
JCO Global Oncology | VOL. 9

A data extraction algorithm for assessment of contraceptive counseling and provision
Brittany J Roser ... Nerys C Benfield
American Journal of Obstetrics and Gynecology | VOL. 218
Brittany J Roser, et. al.Brittany J Roser ... Nerys C Benfield
23 Nov 2017
American Journal of Obstetrics and Gynecology | VOL. 218

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial intelligence approaches for phenotyping heart failure in U.S. Veterans Health Administration electronic health record.

Abstract

Talk to us

Similar Papers

More From: ESC heart failure