BackgroundThe lack of biomarkers to inform antidepressant selection is a key challenge in personalized depression treatment. This work identifies candidate biomarkers by building deep learning predictors of individual treatment outcomes using reward processing measures from functional magnetic resonance imaging, clinical assessments, and demographics. MethodsParticipants in the EMBARC (Establishing Moderators and Biosignatures of Antidepressant Response in Clinical Care) study (n = 222) underwent reward processing task-based functional magnetic resonance imaging at baseline and were randomized to 8 weeks of sertraline (n = 106) or placebo (n = 116). Subsequently, sertraline nonresponders (n = 37) switched to 8 weeks of bupropion. The change in Hamilton Depression Rating Scale was measured after treatment. Reward processing, clinical measurements, and demographics were used to train treatment-specific deep learning models. ResultsThe predictive model for sertraline achieved R2 of 48% (95% CI, 33%–61%; p < 10−3) in predicting the change in Hamilton Depression Rating Scale and number-needed-to-treat (NNT) of 4.86 participants in predicting response. The placebo model achieved R2 of 28% (95% CI, 15%–42%; p < 10−3) and NNT of 2.95 in predicting response. The bupropion model achieved R2 of 34% (95% CI, 10%–59%, p < 10−3) and NNT of 1.68 in predicting response. Brain regions where reward processing activity was predictive included the prefrontal cortex and cerebellar crus 1 for sertraline and the cingulate cortex, caudate, orbitofrontal cortex, and crus 1 for bupropion. ConclusionsThese findings demonstrate the utility of reward processing measurements and deep learning to predict antidepressant outcomes and to form multimodal treatment biomarkers.