Random Survival Forest Research Articles

Abstract Background Measuring heart structure and function drives much cardiac decision making and therapy. Precise quantification is therefore of high importance, but manual (clinician) segmentation introduces measurement variability. Purpose Automated segmentation of cardiac MRI (CMR) left ventricular (LV) volumes provides improved precision but how this translates into clinical outcomes has yet to be defined. In order to test predictive accuracy in a wide range of adverse LV morphologies, we applied AI LV segmentation to patients with coronary disease or cardiomyopathy. Methods Scans were analysed from a retrospective clinical CMR database and mortality data was obtained from a national database. Clinician measurement of indexed LV end-diastolic volume (LVEDVi), LV mass (LVMi), myocardial contraction fraction (MCF: ratio of stroke volume to myocardial volume; a measure of myocardial efficiency) and EF were recorded from clinical reports (all senior clinicians with &gt;5 years level 3 experience) and compared to AI measurement using a segmentation algorithm. This has previously been shown to exceed human precision and required no manual correction (1,2). Random survival forests were built to identify overall predictive value of AI vs clinician derived LV volumes. These are machine-learning algorithms for predicting time-to-event outcomes and can capture complex relationships between predictors and outcome. Random survival forests were built from AI and clinician models with EF, EDVi, LVMi and MCF as predictors for all-cause mortality. A train:test (80:20) split, stratified for outcome was employed. Concordance (C)-indices (measuring predictive accuracy) and permutation importances (measuring significance of each predictor within the model) (Table) with 1000 bootstrapped 95% confidence intervals and p-values were derived. Results 8,299 patients were analysed. Patients were 60(51-70) years old and 37% female. 49% had coronary disease, 25% had DCM and 19% had HCM. At a median follow-up of 5.3 years (IQR 4.2-6.6 years) there were 1,384 deaths (17%). AI derived LV-parameter models had higher predictive accuracy than clinician derived models (C-Indices: 0.66±0.002 vs 0.65±0.005, p&lt;0.001). A subgroup analysis of patients undergoing ischaemia testing for coronary disease was performed (n=3,560) and markers of ischaemia (positive vs negative) and infarction (non-viable vs viable) were included in the models. AI had a higher predictive accuracy compared to clinicians when including these additional predictors (C-Indices: 0.68±0.03 vs 0.63±0.01, p&lt;0.001). Conclusion AI derived LV volumes outperforms clinicians in prediction of all-cause mortality reflecting improved precision. Improvements in predictive accuracy are likely to be clinically important when applied to populations and high-risk groups. Manual clinician segmentation should be superseded by clinician-supervised AI.

Read full abstract

Abstract Background Machine learning (ML) models provide potential advantage over ‘traditional’ regression models in heart failure (HF) prediction. Objective To compare performances of Cox PH models and ML survival models for incident HF in men and women without prevalent ischemic heart disease (IHD). We also aimed to identify potential high-risk precursors otherwise ignored by conventional survival models, and to investigate differences between sex-specific models. Methods We included 476,393 participants (55.6% women) from the UK Biobank, after excluding participants with a history of HF or IHD, and defined sex-specific datasets. We predicted incident HF events using over 400 baseline characteristics. We constructed multivariable Cox PH models, which included all predictor variables and subsequently only those remaining after LASSO stability selection. We also developed two supervised ML models (Random Survival Forest (RSF), eXtreme Gradient Survival Boosting (XGBoost)). We identified the 15 most important sex-specific predictors in each model and performances were compared using the C-index. Models were validated using hold-out sets. Results During 12.3 ± 1.9 years of follow-up, 4680 (1.76%) women and 6631 (3.14%) men developed incident HF. XGBoost showed the best performance during model training (C-index, training: 0.89 in men, 0.97 in women; validation 0.77 in men, 0.80 in women). The multivariable Cox model performed second-best (C-index, training: 0.78 in men, 0.82 in women; validation: 0.76 in men, 0.78 in women). RSF performed slightly worse (C-index, training: 0.75 in men, 0.79 in women; validation: 0.75 in men, 0.79 in women) but did not show performance drop during validation. LASSO stability selection performed similar to RSF. Age, self-reported lifetime treatments and medications, cystatin-C, waist circumference and FEV1-scores were identified as strong risk factors in all models for both sexes. Reduced albumin levels and elevated HbA1c were more strongly associated with high risk in men, while elevated systolic BP showed higher importance in women. Traditional Cox models observed CRP as important only in men, while the ML models identified CRP as important for both sexes. Neutrophil count was considered a strong risk factor in both sexes in the traditional Cox models, yet it was not among the most important predictors in both ML models. Presence of other heart disease (which included a.o. pericardial disease, valve disorders and arrhythmias) was an important predictor variable only in the ML models. Conclusion ML models showed similar performance to Cox PH models for HF prediction. Despite this, differences in predictor importance were identified between models. Sex-specific risk predictors were found, and FEV1 score, which is not commonly included in existing models, was identified as an important risk factor. These results suggest that ML models may reveal additional insights that would otherwise remain unnoticed.

Read full abstract

Random Survival Forest Research Articles

Related Topics

Articles published on Random Survival Forest

Using Machine Learning and Feature Importance to Identify Risk Factors for Mortality in Pediatric Heart Surgery

Transferability and comparability of sewer deterioration models – a case study on Norwegian sewer data

Comparison of multiple machine learning models for predicting prognosis of pancreatic ductal adenocarcinoma based on contrast-enhanced CT radiomics and clinical features

Abstract 4129397: Cardiovascular Risk Models Using Large-Scale Physical Examination Data

Abstract 4143270: PREDICTIVE MODELS AID PHYSICIAN PROGNOSTICATION: A SECONDARY ANALYSIS EVALUATING INTEGRATED MODEL AND PHYSICIAN PROGNOSTIC ESTIMATES IN PATIENTS WITH HEART FAILURE WITH REDUCED EJECTION FRACTION

SPINK5 is a key regulator of eosinophil extracellular traps in head and neck squamous cell carcinoma

Liquid-Liquid Phase Separation in the Prognosis of Lung Adenocarcinoma: An Integrated Analysis.

Random survival forest algorithm for risk stratification and survival prediction in gastric neuroendocrine neoplasms.

Prediction of recurrence-free survival and risk factors of sinonasal inverted papilloma after surgery by machine learning models

Balancing accuracy and Interpretability: An R package assessing complex relationships beyond the Cox model and applications to clinical prediction

Delta-radiomics Approach Using Contrast-enhanced and Non-contrast-enhanced Computed Tomography Images for Predicting Distant Metastasis in Patients with Borderline Resectable Pancreatic Carcinoma

Integrated multi-level omics profiling of disulfidptosis identifis SPAG4 as an innovative immunotherapeutic target in glioblastoma.

Predicting survival benefits of immune checkpoint inhibitor therapy in lung cancer patients: a machine learning approach using real-world data.

AI derived myocardial left ventricular volumes outperforms clinician segmentation for prediction of all-cause mortality

Comprehensive machine learning models for prediction of heart failure in 476,393 women and men from the UK Biobank reveal sex differences and underutilized risk factors

Improving prognostic accuracy in ischemic cardiomyopathy: the CMR-LGE score

Multi-modal artificial intelligence-based prediction for cardiovascular mortality after transcatheter aortic valve implantation: a time-to-event survival prediction

Prediction model for survival of younger patients with breast cancer using the breast cancer public staging database

Detection and risk stratification of cardiac amyloidosis patients by integration of imaging and non-imaging data using a machine learning approach

Left atrial function for predicting atrial fibrillation: a machine learning approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Random Survival Forest Research Articles

Related Topics

Articles published on Random Survival Forest

Using Machine Learning and Feature Importance to Identify Risk Factors for Mortality in Pediatric Heart Surgery

Transferability and comparability of sewer deterioration models – a case study on Norwegian sewer data

Comparison of multiple machine learning models for predicting prognosis of pancreatic ductal adenocarcinoma based on contrast-enhanced CT radiomics and clinical features

Abstract 4129397: Cardiovascular Risk Models Using Large-Scale Physical Examination Data

Abstract 4143270: PREDICTIVE MODELS AID PHYSICIAN PROGNOSTICATION: A SECONDARY ANALYSIS EVALUATING INTEGRATED MODEL AND PHYSICIAN PROGNOSTIC ESTIMATES IN PATIENTS WITH HEART FAILURE WITH REDUCED EJECTION FRACTION

SPINK5 is a key regulator of eosinophil extracellular traps in head and neck squamous cell carcinoma

Liquid-Liquid Phase Separation in the Prognosis of Lung Adenocarcinoma: An Integrated Analysis.

Random survival forest algorithm for risk stratification and survival prediction in gastric neuroendocrine neoplasms.

Prediction of recurrence-free survival and risk factors of sinonasal inverted papilloma after surgery by machine learning models

Balancing accuracy and Interpretability: An R package assessing complex relationships beyond the Cox model and applications to clinical prediction

Delta-radiomics Approach Using Contrast-enhanced and Non-contrast-enhanced Computed Tomography Images for Predicting Distant Metastasis in Patients with Borderline Resectable Pancreatic Carcinoma

Integrated multi-level omics profiling of disulfidptosis identifis SPAG4 as an innovative immunotherapeutic target in glioblastoma.

Predicting survival benefits of immune checkpoint inhibitor therapy in lung cancer patients: a machine learning approach using real-world data.

AI derived myocardial left ventricular volumes outperforms clinician segmentation for prediction of all-cause mortality

Comprehensive machine learning models for prediction of heart failure in 476,393 women and men from the UK Biobank reveal sex differences and underutilized risk factors

Improving prognostic accuracy in ischemic cardiomyopathy: the CMR-LGE score

Multi-modal artificial intelligence-based prediction for cardiovascular mortality after transcatheter aortic valve implantation: a time-to-event survival prediction

Prediction model for survival of younger patients with breast cancer using the breast cancer public staging database

Detection and risk stratification of cardiac amyloidosis patients by integration of imaging and non-imaging data using a machine learning approach

Left atrial function for predicting atrial fibrillation: a machine learning approach