There has been a proliferation of machine learning (ML) electrocardiogram (ECG) classification algorithms reaching>85% accuracy for various cardiac pathologies. Although the accuracy within institutions might be high, models trained at one institution might not be generalizable enough for accurate detection when deployed in other institutions due to differences in type of signal acquisition, sampling frequency, time of acquisition, device noise characteristics and number of leads. In this proof-of-concept study, we leverage the publicly available PTB-XL dataset to investigate the use of time-domain (TD) and frequency-domain (FD) convolutional neural networks (CNN) to detect myocardial infarction (MI), ST/T-wave changes (STTC), atrial fibrillation (AFIB) and sinus arrhythmia (SARRH). To simulate interinstitutional deployment, the TD and FD implementations were also compared on adapted test sets using different sampling frequencies 50Hz, 100Hz and 250Hz, and acquisition times of 5s and 10s at 100Hz sampling frequency from the training dataset. When tested on the original sampling frequency and duration, the FD approach showed comparable results to TD for MI (0.92 FD - 0.93 TD AUROC) and STTC (0.94 FD - 0.95 TD AUROC), and better performance for AFIB (0.99 FD - 0.86 TD AUROC) and SARRH (0.91 FD - 0.65 TD AUROC). Although both methods were robust to changes in sampling frequency, changes in acquisition time were detrimental to the TD MI and STTC AUROCs, at 0.72 and 0.58 respectively. Alternatively, the FD approach was able to maintain the same level of performance, and, therefore, showed better potential for interinstitutional deployment.
Read full abstract