Linear Cox Proportional Hazards Models Research Articles

Sparse survival models are statistical models that select a subset of predictor variables while modeling the time until an event occurs, which can subsequently help interpretability and transportability. The subset of important features is often obtained with regularized models, such as the Cox Proportional Hazards model with Lasso regularization, which limit the number of non-zero coefficients. However, such models can be sensitive to the choice of regularization hyperparameter. In this work, we develop a software package and demonstrate how knowledge distillation, a powerful technique in machine learning that aims to transfer knowledge from a complex teacher model to a simpler student model, can be leveraged to learn sparse survival models while mitigating this challenge. For this purpose, we present sparsesurv, a Python package that contains a set of teacher-student model pairs, including the semi-parametric accelerated failure time and the extended hazards models as teachers, which currently do not have Python implementations. It also contains in-house survival function estimators, removing the need for external packages. Sparsesurv is validated against R-based Elastic Net regularized linear Cox proportional hazards models as implemented in the commonly used glmnet package. Our results reveal that knowledge distillation-based approaches achieve competitive discriminative performance relative to glmnet across the regularization path while making the choice of the regularization hyperparameter significantly easier. All of these features, combined with a sklearn-like API, make sparsesurv an easy-to-use Python package that enables survival analysis for high-dimensional datasets through fitting sparse survival models via knowledge distillation. sparsesurv is freely available under a BSD 3 license on GitHub (https://github.com/BoevaLab/sparsesurv) and The Python Package Index (PyPi) (https://pypi.org/project/sparsesurv/).

Read full abstract

BackgroundVarious biomarkers for prediction of distant metastasis in lymph-node negative breast cancer have been described; however, predictive biomarkers for patients with lymph-node positive (LNP) disease in the context of distinct systemic therapies are still very much needed. DNA methylation is aberrant in breast cancer and is likely to play a major role in disease progression. In this study, the DNA methylation status of 202 candidate loci was screened to identify those loci that may predict outcome in LNP/estrogen receptor-positive (ER+) breast cancer patients with adjuvant anthracycline-based chemotherapy.MethodsQuantitative bisulfite sequencing was used to analyze DNA methylation biomarker candidates in a retrospective cohort of 162 LNP/ER+ breast cancer patients, who received adjuvant anthracycline-based chemotherapy. First, twelve breast cancer specimens were analyzed for all 202 candidate loci to exclude genes that showed no differential methylation. To identify genes that predict distant metastasis, the remaining loci were analyzed in 84 selected cases, including the 12 initial ones. Significant loci were analyzed in the remaining 78 independent cases. Metastasis-free survival analysis was conducted by using Cox regression, time-dependent ROC analysis, and the Kaplan-Meier method. Pairwise multivariate regression analysis was performed by linear Cox Proportional Hazard models, testing the association between methylation scores and clinical parameters with respect to metastasis-free survival.ResultsOf the 202 loci analysed, 37 showed some indication of differential DNA methylation among the initial 12 patient samples tested. Of those, 6 loci were associated with outcome in the initial cohort (n = 84, log rank test, p < 0.05).Promoter DNA methylation of cysteine dioxygenase 1 (CDO1) was confirmed in univariate and in pairwise multivariate analysis adjusting for age at surgery, pathological T stage, progesterone receptor status, grade, and endocrine therapy as a strong and independent biomarker for outcome prediction in the independent validation set (log rank test p-value = 0.0010).ConclusionsCDO1 methylation was shown to be a strong predictor for distant metastasis in retrospective cohorts of LNP/ER+ breast cancer patients, who had received adjuvant anthracycline-based chemotherapy.

Read full abstract

Linear Cox Proportional Hazards Models Research Articles

Articles published on Linear Cox Proportional Hazards Models

Sparsesurv: a Python package for fitting sparse survival models via knowledge distillation.

Interpretable prognostic modeling of endometrial cancer

Cross-Phase Adversarial Domain Adaptation for Deep Disease-free Survival Prediction with Gastric Cancer CT Images.

Quasi-linear Cox proportional hazards model with cross- L1 penalty

DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network

Following the Organ Supply

CDO1 Promoter Methylation is a Biomarker for Outcome Prediction of Anthracycline Treated, Estrogen Receptor-Positive, Lymph Node-Positive Breast Cancer Patients

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Linear Cox Proportional Hazards Models Research Articles

Articles published on Linear Cox Proportional Hazards Models

Sparsesurv: a Python package for fitting sparse survival models via knowledge distillation.

Interpretable prognostic modeling of endometrial cancer

Cross-Phase Adversarial Domain Adaptation for Deep Disease-free Survival Prediction with Gastric Cancer CT Images.

Quasi-linear Cox proportional hazards model with cross- L1 penalty

DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network

Following the Organ Supply

CDO1 Promoter Methylation is a Biomarker for Outcome Prediction of Anthracycline Treated, Estrogen Receptor-Positive, Lymph Node-Positive Breast Cancer Patients