CondiS: A conditional survival distribution-based method for censored data imputation overcoming the hurdle in machine learning-based survival analysis

Yizhuo Wang,Christopher R Flowers,Ziyi Li,Xuelin Huang

doi:10.1016/j.jbi.2022.104117

Yizhuo Wang, Christopher R Flowers + Show 2 more

Open Access

https://doi.org/10.1016/j.jbi.2022.104117

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Data analyses by machine learning (ML) algorithms are gaining popularity in biomedical research. When time-to-event data are of interest, censoring is common and needs to be properly addressed. Most ML methods cannot conveniently and appropriately take the censoring information into consideration, potentially leading to inaccurate or biased results. We aim to develop a general-purpose method for imputing censored survival data, facilitating downstream ML analysis. In this study, we propose a novel method of imputing the survival times for censored observations. The proposal is based on their conditional survival distributions (CondiS) derived from Kaplan-Meier estimators. CondiS can replace censored observations with their best approximations from the statistical model, allowing for direct application of ML methods. When covariates are available, we extend CondiS by incorporating the covariate information through ML modeling (CondiS-X), which further improves the accuracy of the imputed survival time. Compared with existing methods with similar purposes, the proposed methods achieved smaller prediction errors and higher concordance with the underlying true survival times in extensive simulation studies. We also demonstrated the usage and advantages of the proposed methods through two real-world cancer datasets. The major advantage of CondiS is that it allows for the direct application of standard ML techniques for analysis once the censored survival times are imputed. We present a user-friendly R package to implement our method, which is a useful tool for ML-based biomedical research in this era of big data.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Biomedical Informatics	Publication Date: Jun 9, 2022
Citations: 5	License type: publisher-specific-oa

R Discovery Prime

CondiS: A conditional survival distribution-based method for censored data imputation overcoming the hurdle in machine learning-based survival analysis

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics

Lead the way for us

Similar Papers

Context- and Physiology-aware Machine Learning for Upper-Limb Myocontrol
Gauravkumar K Patel
-
Gauravkumar K PatelGauravkumar K Patel
21 Feb 2022
21 Feb 2022

P125. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after anterior cervical spinal fusion
Akash A Shah ... Nelson Soohoo
The Spine Journal | VOL. 21
Akash A Shah, et. al.Akash A Shah ... Nelson Soohoo
10 Aug 2021
The Spine Journal | VOL. 21

P126. Development of a novel ensemble machine learning algorithm for prediction of complications and readmission after posterior cervical spinal fusion
Akash A Shah ... Nelson Soohoo
The Spine Journal | VOL. 21
Akash A Shah, et. al.Akash A Shah ... Nelson Soohoo
10 Aug 2021
The Spine Journal | VOL. 21

Applications of machine learning in friction stir welding: Prediction of joint properties, real-time control and tool failure diagnosis
Ammar H Elsheikh
Engineering Applications of Artificial Intelligence | VOL. 121
Ammar H ElsheikhAmmar H Elsheikh
14 Feb 2023
Engineering Applications of Artificial Intelligence | VOL. 121

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

CondiS: A conditional survival distribution-based method for censored data imputation overcoming the hurdle in machine learning-based survival analysis

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Biomedical Informatics