Early detection of Alzheimer’s disease (AD) is crucial to maximize clinical outcomes. Most disease progression analyses include people with diagnoses of cognitive impairment, limiting understanding of AD risk among those with normal cognition. The objective was to establish AD progression models through a deep learning approach to analyze heterogeneous, multi-modal datasets, including clustering analyses of population subsets. A multi-head deep-learning architecture was built to process and learn from biomedical and imaging data from the National Alzheimer’s Coordinating Center. Shapley additive explanation algorithms for feature importance ranking and pairwise correlation analysis were used to identify predictors of disease progression. Four primary disease progression clusters (slow, moderate and rapid converters or non-converters) were subdivided into groups by race and sex, yielding 16 sub-clusters of participants with distinct progression patterns. A multi-head and early-fusion convolutional neural network achieved the most competitive performance and demonstrated superiority over a single-head deep learning architecture and conventional tree-based machine-learning methods, with 97% test accuracy, 96% F1 score and 0.19 root mean square error. From 447 features, 2 sets of 100 predictors of disease progression were extracted. Feature importance ranking, correlation analysis and descriptive statistics further enriched cluster analysis and validation of the heterogeneity of risk factors.