Abstract

Introduction: Cardiac amyloidosis (CA) is a common cause of progressive heart failure. New therapies can improve outcomes but most CA patients remain undiagnosed and untreated. Machine learning models deployed on electronic health record (EHR) data may be able to find patients with undiagnosed CA. To date, most models have focused on identification of undiagnosed amyloid from uncensored data modalities (Fig 1). Hypothesis: We hypothesized that lack of post-diagnosis censoring when training CA models leads to poor performance in predicting patients with undiagnosed CA whereas training with appropriate time censoring improves performance. Methods: We used 41 EHR features (demographics, labs, electrocardiogram/echocardiography measurements, vitals) to train a boosted decision tree model with and without time censoring. This was applied to 112 patients with confirmed CA and 22,400 controls matched on age, sex, encounter frequency and timespan of EHR. We also compared our findings to a web-based CA algorithm that was publicly available in 2020. Results: The EHR algorithm had modestly higher performance on at-risk, time-censored patients when trained with and without time censoring (area under the receiver operating characteristic curve (AUROC) 0.84±0.09 vs 0.79±0.07). Testing on temporally uncensored data showed higher performance (AUROC: 0.91±0.05) which may be unrepresentative of deployment scenarios where post-diagnostic features are unavailable for model use. The publicly available algorithm demonstrated a similar trend when tested on uncensored data (AUROC: 0.67±0.03) as compared to an appropriately censored feature set (AUROC: 0.54±0.04). Conclusions: EHR algorithms can be trained to find patients with high risk of undiagnosed cardiac amyloidosis. These models should be evaluated on temporally censored data so that post-diagnostic features do not artificially inflate performance estimates and negatively impact real-world deployment.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call