AbstractBackgroundEarly detection and diagnosis are critical for effective treatment of Alzheimer’s disease (AD). Statistical machine learning holds promise for disease prediction using large AD repositories, such as the National Alzheimer’s Coordinating Center (NACC) dataset. Past work has extensively explored binary classifications of AD diagnosis. Here, we extended this to address two major, remaining challenges: data imputation and multiclass classification (e.g., AD vs Mild Cognitive Impairment, MCI vs. Healthy Cognition, HC).MethodFor imputation, we tested single‐value (baseline), k‐NN, and Bayesian imputation on a subset of complete training data with artificially removed values (completely at random). Per modality of interest, a systematic search was conducted for the best input feature set to impute the missing values. The mean absolute deviation and mismatch rate between the imputed and true values were used to evaluate the methods on a held‐out test set. For classification, we then implemented both “flat” multiclass and hierarchical classification, an extension of multiclass classification that organizes classes into a tree, to predict clinical diagnosis using multimodal inputs. Several pipelines with combinations of imputation methods, classifiers, and input modalities were tested on different hierarchical classification strategies, e.g., HC vs [AD vs MCI], compared with the flat, multiclass baseline (AD vs MCI vs HC).ResultCompared to the baseline methods for both imputation and classification, the proposed alternatives performed better, as measured by the predetermined imputation and classification metrics evaluated on the test set. For imputation, behavioral and neuropsychiatric features were more easily and accurately imputed while ApoE genotype was more difficult to impute. For classification, using a combination of classifiers and input features in the hierarchy outperformed the baseline method and support the notion that select biomarkers are better able to characterize AD subpopulations within the hierarchy.ConclusionEstablishing a systematic imputation method tailored to AD biomarker modalities fills a critical gap in achieving more robust disease prediction. Building a hierarchical classifier parallels working through a differential diagnosis for AD, which offers greater translational value than binary classification of AD. In combination, these results form an important step toward developing reliable, holistic classifiers of AD disease status.