Introduction Mantle cell lymphoma (MCL) presents challenges due to its aggressive nature and dismal treatment outcomes. The Mantle Cell Lymphoma International Prognostic Index (MIPI) score is valuable for risk stratification, but it may not fully capture the complexity of MCL biology. To address this, we explored innovative data-driven methods based on machine learning to augment the MIPI score. By integrating clinical and histological data with advanced algorithms, this study aimed to enhance risk assessment in MCL. Improved prognostication of survival could facilitate the design of risk-tailored therapeutic approaches, directing more aggressive treatments to high-risk patients, ultimately leading to personalized and precise management. Methods The following patient-level variables were retrieved: age, gender, Ann-Arbor stage (I-II vs III-IV), Bulky disease, extranodal disease, histopathological variant (classic vs pleomorphic/blastoid lymphoma), and Ki67+ cells. The MIPI score was used to categorize patients into low, intermediate, and high-risk groups. Employing random survival forests, we developed a novel risk score by integrating clinical and histological data. Missing variables were imputed using a random forests algorithm. Harrel's concordance indexes (c-indexes) were used to evaluate model's accuracy. Results We assembled a real-world database of 231 MCL patients from 6 Spanish tertiary hospitals (Jan 2000-Dec 2021), and another cohort for validation from another Spanish tertiary hospital (N=44). Median age at diagnosis was 64 years in both cohorts. Median follow-up was 6.28 y and 8.67 y in the training and test sets, and median overall survival (OS) was 11.48 y and 5.61 y. According to the MIPI score, 44% and 38% of patients were of high risk in the training and validation sets; 26% and 39% were of intermediate risk, and 30% & 23% were of low risk. The c-index of the MIPI groups for the prediction of OS was 0.716 and 0.684 in the training and validation sets, respectively. Then, a random survival forest algorithm was used to derive a new risk score in the training set. The first model contained all the available variables mentioned previously. This 8-variable model achieved a cross-validated c-index of 0.764 in the training set. A variable reduction technique was implemented to remove those variables that provided less independent prognostic value. A simplified model was developed, which contained the following variables by order of importance: histological subtype, MIPI-derived groups, age at diagnosis and Ann-Arbor stage. This model outperformed the conventional MIPI grouping strategy, achieving c-indexes of 0.764 and 0.775 in the training and validation sets, respectively ( Figure 1). Conclusion This study showcases the potential of a novel machine learning model in MCL prognostication beyond the traditional MIPI score, utilizing easily accessible clinical information. The inclusion of histological subtype, MIPI score, age at diagnosis, and Ann-Arbor stage as robust independent prognostic factors improves risk prediction accuracy. Future research may further refine the model and explore additional variables to continue optimizing prognostication and treatment strategies for MCL patients.
Read full abstract