Does the SORG Machine-learning Algorithm for Extremity Metastases Generalize to a Contemporary Cohort of Patients? Temporal Validation From 2016 to 2020.

Tom M De Groot,Olivier Q Groot,Austin Keith Collins,Duncan Ramsey,Job N Doornberg,Marco Ferrone,Emily A Berner,Joseph H Schwab,Eric Newman,Peter K Twining,Kevin Raskin,Brian P Fenn,Santiago Lozano,Aditya V Karhade,Mitchell Fourman

doi:10.1097/corr.0000000000002698

Abstract

The ability to predict survival accurately in patients with osseous metastatic disease of the extremities is vital for patient counseling and guiding surgical intervention. We, the Skeletal Oncology Research Group (SORG), previously developed a machine-learning algorithm (MLA) based on data from 1999 to 2016 to predict 90-day and 1-year survival of surgically treated patients with extremity bone metastasis. As treatment regimens for oncology patients continue to evolve, this SORG MLA-driven probability calculator requires temporal reassessment of its accuracy. Does the SORG-MLA accurately predict 90-day and 1-year survival in patients who receive surgical treatment for a metastatic long-bone lesion in a more recent cohort of patients treated between 2016 and 2020? Between 2017 and 2021, we identified 674 patients 18 years and older through the ICD codes for secondary malignant neoplasm of bone and bone marrow and CPT codes for completed pathologic fractures or prophylactic treatment of an impending fracture. We excluded 40% (268 of 674) of patients, including 18% (118) who did not receive surgery; 11% (72) who had metastases in places other than the long bones of the extremities; 3% (23) who received treatment other than intramedullary nailing, endoprosthetic reconstruction, or dynamic hip screw; 3% (23) who underwent revision surgery, 3% (17) in whom there was no tumor, and 2% (15) who were lost to follow-up within 1 year. Temporal validation was performed using data on 406 patients treated surgically for bony metastatic disease of the extremities from 2016 to 2020 at the same two institutions where the MLA was developed. Variables used to predict survival in the SORG algorithm included perioperative laboratory values, tumor characteristics, and general demographics. To assess the models' discrimination, we computed the c-statistic, commonly referred to as the area under the receiver operating characteristic (AUC) curve for binary classification. This value ranged from 0.5 (representing chance-level performance) to 1.0 (indicating excellent discrimination) Generally, an AUC of 0.75 is considered high enough for use in clinical practice. To evaluate the agreement between predicted and observed outcomes, a calibration plot was used, and the calibration slope and intercept were calculated. Perfect calibration would result in a slope of 1 and intercept of 0. For overall performance, the Brier score and null-model Brier score were determined. The Brier score can range from 0 (representing perfect prediction) to 1 (indicating the poorest prediction). Proper interpretation of the Brier score necessitates a comparison with the null-model Brier score, which represents the score for an algorithm that predicts a probability equal to the population prevalence of the outcome for each patient. Finally, a decision curve analysis was conducted to compare the potential net benefit of the algorithm with other decision-support methods, such as treating all or none of the patients. Overall, 90-day and 1-year mortality were lower in the temporal validation cohort than in the development cohort (90 day: 23% versus 28%; p < 0.001, and 1 year: 51% versus 59%; p<0.001). Overall survival of the patients in the validation cohort improved from 28% mortality at the 90-day timepoint in the cohort on which the model was trained to 23%, and 59% mortality at the 1-year timepoint to 51%. The AUC was 0.78 (95% CI 0.72 to 0.82) for 90-day survival and 0.75 (95% CI 0.70 to 0.79) for 1-year survival, indicating the model could distinguish the two outcomes reasonably. For the 90-day model, the calibration slope was 0.71 (95% CI 0.53 to 0.89), and the intercept was -0.66 (95% CI -0.94 to -0.39), suggesting the predicted risks were overly extreme, and that in general, the risk of the observed outcome was overestimated. For the 1-year model, the calibration slope was 0.73 (95% CI 0.56 to 0.91) and the intercept was -0.67 (95% CI -0.90 to -0.43). With respect to overall performance, the model's Brier scores for the 90-day and 1-year models were 0.16 and 0.22. These scores were higher than the Brier scores of internal validation of the development study (0.13 and 0.14) models, indicating the models' performance has declined over time. The SORG MLA to predict survival after surgical treatment of extremity metastatic disease showed decreased performance on temporal validation. Moreover, in patients undergoing innovative immunotherapy, the possibility of mortality risk was overestimated in varying severity. Clinicians should be aware of this overestimation and discount the prediction of the SORG MLA according to their own experience with this patient population. Generally, these results show that temporal reassessment of these MLA-driven probability calculators is of paramount importance because the predictive performance may decline over time as treatment regimens evolve. The SORG-MLA is available as a freely accessible internet application at https://sorg-apps.shinyapps.io/extremitymetssurvival/ .Level of Evidence Level III, prognostic study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Clinical Orthopaedics & Related Research	Publication Date: May 25, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Does the SORG Machine-learning Algorithm for Extremity Metastases Generalize to a Contemporary Cohort of Patients? Temporal Validation From 2016 to 2020.

Abstract

Talk to us

Similar Papers

More From: Clinical Orthopaedics & Related Research

Lead the way for us

Similar Papers

International Validation of the SORG Machine-learning Algorithm for Predicting the Survival of Patients with Extremity Metastases Undergoing Surgical Treatment.
Ting-En Tseng ... Rong-Sen Yang
Clinical Orthopaedics & Related Research | VOL. 480
Ting-En Tseng, et. al.Ting-En Tseng ... Rong-Sen Yang
07 Sep 2021
Clinical Orthopaedics & Related Research | VOL. 480

How Does the Skeletal Oncology Research Group Algorithm's Prediction of 5-year Survival in Patients with Chondrosarcoma Perform on International Validation?
Michiel E R Bongers ... Kivilcim E Erdoğan
Clinical orthopaedics and related research | VOL. 478
Michiel E R Bongers, et. al.Michiel E R Bongers ... Kivilcim E Erdoğan
18 May 2020
Clinical orthopaedics and related research | VOL. 478

External validation of the SORG machine learning algorithms for predicting 90-day and 1-year survival of patients with lung cancer-derived spine metastases: a recent bi-center cohort from China
Guoqing Zhong ... Yu Zhang
The Spine Journal | VOL. 23
Guoqing Zhong, et. al.Guoqing Zhong ... Yu Zhang
25 Jan 2023
The Spine Journal | VOL. 23

Does the SORG Algorithm Predict 5-year Survival in Patients with Chondrosarcoma? An External Validation.
Michiel E R Bongers ... Kevin A Raskin
Clinical Orthopaedics & Related Research | VOL. 477
Michiel E R Bongers, et. al.Michiel E R Bongers ... Kevin A Raskin
27 Apr 2019
Clinical Orthopaedics & Related Research | VOL. 477

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Does the SORG Machine-learning Algorithm for Extremity Metastases Generalize to a Contemporary Cohort of Patients? Temporal Validation From 2016 to 2020.

Abstract

Talk to us

Similar Papers

More From: Clinical Orthopaedics &amp; Related Research

More From: Clinical Orthopaedics & Related Research