Abstract

There are many major diseases that remain incurable, such as cancer and Alzheimer’s disease (AD). Therefore, the prevention for these diseases has more impact than diagnosis and treatment. Survival analysis aims at predicting the time of occurrence of specific events of interest. It can be used to identify patients of high risk, which helps healthcare system effectively allocate limited medical resources. In many real-world applications, such as healthcare analysis, a lot of datasets are collected from multiple data sources and exhibit a block-wise missing pattern, i.e., each patient takes different types of tests and receives various treatments, and each test/treatment associates with a set of corresponding features. However, the existing survival analysis methods such as the Cox proportional hazards model and its extensions are designed for fully observed datasets and can not be directly applied when such block-wise missing pattern presents. This paper addresses this challenge and enables the survival prediction models to deal with high-dimensional multi-source block-wise missing data. Specifically, we employ a partition method that decomposes the multi-source block-wise missing data into multiple completed sub-matrix; thus, the original problem is transformed into a series of related multi-source survival analysis problems. To deal with these problems, we propose a two-layer multi-task learning framework that achieves both feature-level and source-level analysis, and the proposed framework is able to take advantage of the structure information in the block-wise missing pattern. Based on the proposed framework, we formulate two concrete models for survival analysis to handle multi-source block-wise missing data. We apply the proposed method in the real-world benchmark Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset and The Cancer Genome Atlas (TCGA) dataset to study the stage conversion of AD patients and survival time of cancer patients, respectively. The results from our comprehensive evaluations show that our methods outperform the state-of-the-art methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.