AbstractBackgroundMachine learning models can leverage historical data to forecast disease progression. These predictions can be integrated in clinical trial design to reduce sample size or increase power, speeding up the evaluation of new drugs. This is especially critical in AD where trials face challenges with enrollment and where no new therapies have emerged in 18 years. Recently, there has been a shift towards evaluating drugs in earlier stages of the disease using Clinical Dementia Rating Sum‐of‐Boxes (CDR‐SB) as the primary endpoint. We present a model that includes CDR‐SB and spans a broad range of baseline disease severity from MCI to mild‐to‐moderate AD.MethodWe used nearly 7,000 clinical records from placebo arms of AD clinical trials (in the C‐Path Online Data Repository for AD) and from observational studies (in the AD Neuroimaging Initiative) to train Conditional Restricted Boltzmann Machines (CRBMs). A CRBM is a generative machine learning model that learns a multivariate distribution over the relevant variables, and is particularly suited to model clinical data. A CRBM generates Digital Twins ‐ synthetic clinical records with baseline characteristics matched to those of actual trial subjects describing their likely progression under standard‐of‐care (SOC) with/without placebo. These forecasts include all components of CDR‐SB, Alzheimer’s Disease Assessment Scale ‐ Cognitive Subscale (ADAS‐Cog11) and Mini Mental State Examination (MMSE), along with other variables including labs, vitals, and biomarker status. We evaluated Digital Twins on a held‐out portion of the dataset (not included in the training dataset), stratified by baseline ADAS‐Cog11. For each variable, the mean change from baseline +/‐ 95% confidence interval error at 3‐month intervals was estimated by drawing 100 Digital Twins for each subject and averaging over the population.ResultCRBM‐generated predictions of ADAS‐Cog11, MMSE, and CDR‐SB progression up to 18 months were consistent within 95% confidence intervals with actual data across all scores, timepoints, and cohorts.ConclusionDigital Twins accurately model MCI and AD subjects’ progression on SOC with/without placebo across a broad range of baseline disease severity, and can be integrated into clinical trials as prognostic scores to increase power or reduce sample size required to achieve desired power.