Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification

Yin Guo,Menglan Cai,Limin Li,Huiran Li

doi:10.1186/s12920-019-0633-1

Yin Guo, Menglan Cai + Show 2 more

Open Access

https://doi.org/10.1186/s12920-019-0633-1

Copy DOI

Journal: BMC Medical Genomics	Publication Date: Dec 1, 2019
Citations: 4	License type: open-access

Affiliation: Xi'an Jiaotong University

Abstract

BackgroundRecent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume strong consistency among different sources of datasets, and thus may lose efficacy when the consistency is relatively weak. Furthermore, they could not identify the conflicting parts for each view, which might be important in applications such as cancer subtype identification.MethodsIn this work, we propose an integrative subspace clustering method (ISC) by common and specific decomposition to identify clustering structures with multi-omics datasets. The main idea of our ISC method is that the original representations for the samples in each view could be reconstructed by the concatenation of a common part and a view-specific part in orthogonal subspaces. The problem can be formulated as a matrix decomposition problem and solved efficiently by our proposed algorithm.ResultsThe experiments on simulation and text datasets show that our method outperforms other state-of-art methods. Our method is further evaluated by identifying cancer types using a colorectal dataset. We finally apply our method to cancer subtype identification for five cancers using TCGA datasets, and the survival analysis shows that the subtypes we found are significantly better than other compared methods.ConclusionWe conclude that our ISC model could not only discover the weak common information across views but also identify the view-specific information.

Highlights

Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets
The log-rank p-values for all the methods are reported in Table 6. we can see from the table that, for four cancers including glioblastoma multiforme (GBM), breast invasive carcinoma (BIC), Kidney cancer (KRCCC), and lung squamous cell carcinoma (LSCC), our integrative subspace clustering method (ISC) method could obtain the most significant p-values
The subtypes for GBM and KRCCC found by the common part across three views obtain the most significant pvalues, the BIC subtypes found by miRNA expression are the most significant, and the subtypes for LSCC found by DNA methylation are the most significant

Summary

Introduction

Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Most existing clustering methods by multi-omics data assume strong consistency among different sources of datasets, and may lose efficacy when the consistency is relatively weak. They could not identify the conflicting parts for each view, which might be important in applications such as cancer subtype identification. Most molecular studies of subtype identification for breast cancer integrate genomic, epigenomic, and transcriptomic profiling including mRNA expression profiling, miRNA expression, DNA methylation and DNA copy number analysis, and so on. Multi-view clustering takes information from all views into account such that better clustering structures could be discovered

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Genomics

Lead the way for us

Similar Papers

Autoencoder-assisted latent representation learning for survival prediction and multi-view clustering on multi-omics cancer subtyping.
Shuwei Zhu ... Wei Fang
Mathematical biosciences and engineering : MBE | VOL. 20
Shuwei Zhu, et. al.Shuwei Zhu ... Wei Fang
01 Jan 2023
Mathematical biosciences and engineering : MBE | VOL. 20

PCA-constrained multi-core matrix fusion network: A novel approach for cancer subtype identification.
Min Li ... Shaobo Deng
Journal of bioinformatics and computational biology | VOL. 22
Min Li, et. al.Min Li ... Shaobo Deng
01 Aug 2024
Journal of bioinformatics and computational biology | VOL. 22

Abstract 7566: Identification of cancer subtypes with a ctDNA-based targeted methylation assay
Tracy Nance ... Charles Swanton
Cancer Research | VOL. 84
Tracy Nance, et. al.Tracy Nance ... Charles Swanton
22 Mar 2024
Abstract 7566: Identification of cancer subtypes with a ctDNA-based targeted methylation assay
Tracy Nance ... Charles Swanton

Supervised Graph Clustering for Cancer Subtyping Based on Survival Analysis and Integration of Multi-Omic Tumor Data.
Cheng Liu ... Wenming Cao
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19
Cheng Liu, et. al.Cheng Liu ... Wenming Cao
21 Jul 2020
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Medical Genomics