Real-world performance, long-term efficacy, and absence of bias in the artificial intelligence enhanced electrocardiogram to detect left ventricular systolic dysfunction.

David M Harmon,Francisco Lopez-Jimenez,Anna Svatikova,Rickey E Carter,Demilade A Adedinsewo,Michal Cohen-Shelly,Zachi I Attia,Paul A Friedman,Peter A Noseworthy,Suraj Kapa

doi:10.1093/ehjdh/ztac028

David M Harmon, Francisco Lopez-Jimenez + Show 8 more

Open Access

https://doi.org/10.1093/ehjdh/ztac028

Copy DOI

Journal: European heart journal. Digital health	Publication Date: May 17, 2022
Citations: 10	License type: CC BY-NC 4.0

Affiliation: Mayo Clinic

Abstract

AimsSome artificial intelligence models applied in medical practice require ongoing retraining, introduce unintended racial bias, or have variable performance among different subgroups of patients. We assessed the real-world performance of the artificial intelligence-enhanced electrocardiogram to detect left ventricular systolic dysfunction with respect to multiple patient and electrocardiogram variables to determine the algorithm’s long-term efficacy and potential bias in the absence of retraining.Methods and resultsElectrocardiograms acquired in 2019 at Mayo Clinic in Minnesota, Arizona, and Florida with an echocardiogram performed within 14 days were analyzed (n = 44 986 unique patients). The area under the curve (AUC) was calculated to evaluate performance of the algorithm among age groups, racial and ethnic groups, patient encounter location, electrocardiogram features, and over time. The artificial intelligence-enhanced electrocardiogram to detect left ventricular systolic dysfunction had an AUC of 0.903 for the total cohort. Time series analysis of the model validated its temporal stability. Areas under the curve were similar for all racial and ethnic groups (0.90–0.92) with minimal performance difference between sexes. Patients with a ‘normal sinus rhythm’ electrocardiogram (n = 37 047) exhibited an AUC of 0.91. All other electrocardiogram features had areas under the curve between 0.79 and 0.91, with the lowest performance occurring in the left bundle branch block group (0.79).ConclusionThe artificial intelligence-enhanced electrocardiogram to detect left ventricular systolic dysfunction is stable over time in the absence of retraining and robust with respect to multiple variables including time, patient race, and electrocardiogram features.

Full Text