Abstract Background Common risk scores for atrial fibrillation (AF) are based on clinical risk factors, and little is known about the contribution of left atrial (LA) function parameters to risk assessment. Purpose To determine whether a machine learning-based model that includes LA function parameters outperforms a commonly used risk scores, the CHARGE-AF, in predicting AF. Methods Participants in the Atherosclerosis Risk in Communities (ARIC) Study (a community-based cohort study in the USA) who attended visit 5 (2011-2013) and without a history of AF were included in this analysis. Incident AF was ascertained from ECG during study visits, hospitalization discharge codes, and death certificates. Random survival forest methodology was used to build a prediction model for AF; candidate variables included traditional clinical risk factors and measures of LA structure and function. Dataset was randomly split into a training (75%) and testing (25%) cohort. For each variable, importance was measured based on the reduction in Harrell’s c-statistic if the variable was randomly permuted prior to obtaining predicted values. Variables significantly associated with AF were included in the final model. Model calibration was tested using the Greenwood-Nam-D'Agostino (GND) test. Discrimination ability of our model was assessed in the testing cohort using Harrell’s c-statistic and compared with that of the CHARGE-AF model. Results 4,909 participants (59% female, 22% Black, mean age 75 years) were included. During a median follow-up of 7.0 years (1st-3rd quartile: 5.4-7.8), 650 participants developed AF. Variables significantly associated with incident AF were race, triglycerides, systolic blood pressure, coronary heart disease, age, LA maximal volume index, NT-proBNP, and LA reservoir strain (Figure 1). LA reservoir strain was the variable most strongly associated with AF. The final model showed good discrimination (c-statistic (95% CI): 0.734 (0.698-0.768)). The GND test did not detect statistically significant miscalibration (GND p=0.08). Compared with our model, CHARGE-AF had lower discrimination ability (c-statistic (95% CI): 0.635 (0.594-0.677)) and the difference was statistically significant (difference in c-statistic (95% CI): 0.099 (0.043-0.152)). Conclusion A machine learning-based model with few variables including LA function outperforms a commonly used risk model in predicting AF. Incorporation of LA functional assessment may help inform clinicians regarding individual risk of AF.