ObjectivesAccuracy in the detection of recurrent nasopharyngeal carcinoma (NPC) on follow-up magnetic resonance (MR) scans needs to be improved. Material and methodsA total of 5 035 follow-up MR scans from 5 035 survivors with treated NPC between April 2007 and July 2020 were retrospectively collected from three cancer centers for developing and evaluating the deep learning (DL) model MODERN (MR-based Deep learning model for dEtecting Recurrent Nasopharyngeal carcinoma). In a reader study with 220 scans, the accuracy of two radiologists in detecting recurrence on scans with vs without MODERN was evaluated. The performance was measured using the area under the receiver operating characteristic curve (ROC-AUC) and accuracy with a 95% confidence interval (CI). ResultsMODERN exhibited sound performance in the validation cohort (internal: ROC-AUC, 0.88, 95% CI, 0.86–0.90; external 1: ROC-AUC, 0.88, 95% CI, 0.86–0.90; external 2: ROC-AUC, 0.85, 95% CI, 0.82–0.88). In a reader study, MODERN alone achieved reliable accuracy compared to that of radiologists (MODERN: 84.1%, 95% CI, 79.3%–88.9%; competent: 80.9%, 95% CI, 75.7%–86.1%, P < 0.001; expert: 85.9%, 95% CI, 81.3%–90.5%, P < 0.001). The accuracy of radiologists was boosted by the MODERN score (competent with MODERN score: 84.6%, 95% CI, 79.8%–89.3%, P < 0.001; expert with MODERN score: 87.7%, 95% CI, 83.4%–92.1%, P < 0.001). ConclusionWe developed a DL model for recurrence detection with reliable performance. Computer-human collaboration has the potential to refine the workflow in interpreting surveillant MR scans among patients with treated NPC.