Abstract Background Many studies have utilized data sources such as clinical variables, polygenic risk scores, ECG, and plasma proteins to predict the risk of atrial fibrillation (AF). However, few studies have integrated all four sources from a single study to comprehensively assess AF prediction. Methods We included 8374 (Visit 3, 1993-1995) and 3,730 (Visit 5, 2011-2013) participants from the Atherosclerosis Risk in Communities Study to predict incident AF and prevalent (but covert) AF. We constructed a 1) clinical risk score using CHARGE-AF clinical variables, 2) polygenic risk score using pre-determined weights, 3) protein risk score using penalized and regularized logistic regression, and 4) ECG risk score from convolutional neural network. Risk prediction performance was measured using regularized logistic regression. Results After a median follow up of 15.1 years, 1910 AF events occurred since Visit 3 and 229 participants had prevalent AF at Visit 5. The AUC improved from 0.660 to 0.752 (95% CI, 0.741-0.763) and from 0.737 to 0.854 (95% CI, 0.828-0.880) after addition of polygenic risk score to the CHARGE-AF clinical variables for predicting incident and prevalent AF, respectively. Further addition of ECG and protein risk scores improved the AUC to 0.763 (95% CI, 0.753-0.772) and 0.875 (95% CI, 0.851-0.899) for predicting incident and prevalent AF, respectively. Conclusions A combination of clinical and polygenic risk scores was the most effective and parsimonious approach to predicting AF. Further addition of an ECG risk score or protein risk score provided only modest incremental improvement for predicting AF.
Read full abstract