Subtype classification and heterogeneous prognosis model construction in precision medicine.

Na You,Heping Zhang,Shun He,Xueqin Wang,Junxian Zhu

doi:10.1111/biom.12843

Abstract

Common diseases including cancer are heterogeneous. It is important to discover disease subtypes and identify both shared and unique risk factors for different disease subtypes. The advent of high-throughput technologies enriches the data to achieve this goal, if necessary statistical methods are developed. Existing methods can accommodate both heterogeneity identification and variable selection under parametric models, but for survival analysis, the commonly used Cox model is semiparametric. Although finite-mixture Cox model has been proposed to address heterogeneity in survival analysis, variable selection has not been incorporated into such semiparametric models. Using regularization regression, we propose a variable selection method for the finite-mixture Cox model and select important, subtype-specific risk factors from high-dimensional predictors. Our estimators have oracle properties with proper choices of penalty parameters under the regularization regression. An expectation-maximization algorithm is developed for numerical calculation. Simulations demonstrate that our proposed method performs well in revealing the heterogeneity and selecting important risk factors for each subtype, and its performance is compared to alternatives with other regularizers. Finally, we apply our method to analyze a gene expression dataset for ovarian cancer DNA repair pathways. Based on our selected risk factors, the prognosis model accounting for heterogeneity consistently improves the prediction for the survival probability in both training and test datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Subtype classification and heterogeneous prognosis model construction in precision medicine.

Abstract

Talk to us

Similar Papers

More From: Biometrics

Lead the way for us

Journal: Biometrics	Publication Date: Jan 22, 2018
Citations: 5

Similar Papers

A split-and-conquer variable selection approach for high-dimensional general semiparametric models with massive data
Jianglin Fang
Journal of Multivariate Analysis | VOL. 194
Jianglin FangJianglin Fang
17 Nov 2022
Journal of Multivariate Analysis | VOL. 194

Functional index coefficient models with variable selection
Zongwu Cai ... Bingduo Yang
Journal of Econometrics | VOL. 189
Zongwu Cai, et. al.Zongwu Cai ... Bingduo Yang
19 Mar 2015
Journal of Econometrics | VOL. 189

Adaptive Rejection Metropolis Simulated Annealing for Detecting Global Maximum Regions
Huaiye Zhang ... Inyoung Kim
Methodology and Computing in Applied Probability | VOL. 18
Huaiye Zhang, et. al.Huaiye Zhang ... Inyoung Kim
21 Feb 2014
Methodology and Computing in Applied Probability | VOL. 18

Bayesian Variable Selection Regression of Multivariate Responses for Group Data
B Liquet ... K Mengersen
Bayesian Analysis | VOL. 12
B Liquet, et. al.B Liquet ... K Mengersen
01 Dec 2017
Bayesian Analysis | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subtype classification and heterogeneous prognosis model construction in precision medicine.

Abstract

Talk to us

Similar Papers

More From: Biometrics