Clustering multilayer omics data using MuNCut

Sebastian J Teran Hidalgo,Shuangge Ma

doi:10.1186/s12864-018-4580-6

Sebastian J Teran Hidalgo, Shuangge Ma

Open Access

https://doi.org/10.1186/s12864-018-4580-6

Copy DOI

Journal: BMC Genomics	Publication Date: Mar 14, 2018
Citations: 13	License type: open-access

Affiliation: Yale University, Taiyuan University of Technology

Abstract

BackgroundOmics profiling is now a routine component of biomedical studies. In the analysis of omics data, clustering is an essential step and serves multiple purposes including for example revealing the unknown functionalities of omics units, assisting dimension reduction in outcome model building, and others. In the most recent omics studies, a prominent trend is to conduct multilayer profiling, which collects multiple types of genetic, genomic, epigenetic and other measurements on the same subjects. In the literature, clustering methods tailored to multilayer omics data are still limited. Directly applying the existing clustering methods to multilayer omics data and clustering each layer first and then combing across layers are both “suboptimal” in that they do not accommodate the interconnections within layers and across layers in an informative way.MethodsIn this study, we develop the MuNCut (Multilayer NCut) clustering approach. It is tailored to multilayer omics data and sufficiently accounts for both across- and within-layer connections. It is based on the novel NCut technique and also takes advantages of regularized sparse estimation. It has an intuitive formulation and is computationally very feasible. To facilitate implementation, we develop the function muncut in the R package NcutYX.ResultsUnder a wide spectrum of simulation settings, it outperforms competitors. The analysis of TCGA (The Cancer Genome Atlas) data on breast cancer and cervical cancer shows that MuNCut generates biologically meaningful results which differ from those using the alternatives.ConclusionsWe propose a more effective clustering analysis of multiple omics data. It provides a new venue for jointly analyzing genetic, genomic, epigenetic and other measurements.

Highlights

Omics profiling is a routine component of biomedical studies
The bottom layer consists of Copy number variation (CNV), the middle layer consists of Gene expression (GE), and the upper layer consists of proteins
A small number of CNVs in the lower layer regulate a small number of GEs in the middle layer, which encode a small number of proteins in the upper layer

Summary

Introduction

Omics profiling is a routine component of biomedical studies. In the analysis of omics data, clustering is an essential step and serves multiple purposes including for example revealing the unknown functionalities of omics units, assisting dimension reduction in outcome model building, and others. Applying the existing clustering methods to multilayer omics data and clustering each layer first and combing across layers are both “suboptimal” in that they do not accommodate the interconnections within layers and across layers in an informative way. Clustering results can be used in multiple ways They can suggest the unknown functionalities of omics units, with those in the same clusters likely to have related biological functions [1].

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering multilayer omics data using MuNCut

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data.
Qianxing Mo ... Marina Vannucci
Biostatistics | VOL. 19
Qianxing Mo, et. al.Qianxing Mo ... Marina Vannucci
24 May 2017
Biostatistics | VOL. 19

MOGSA: Integrative Single Sample Gene-set Analysis of Multiple Omics Data
Chen Meng ... Aedín C Culhane
Molecular & Cellular Proteomics | VOL. 18
Chen Meng, et. al.Chen Meng ... Aedín C Culhane
01 Aug 2019
Molecular & Cellular Proteomics | VOL. 18

An integrative U method for joint analysis of multi-level omic data
Pei Geng ... Qing Lu
BMC Genetics | VOL. 20
Pei Geng, et. al.Pei Geng ... Qing Lu
10 Apr 2019
BMC Genetics | VOL. 20

Assisted clustering of gene expression data using ANCut
Sebastian J Teran Hidalgo ... Shuangge Ma
BMC Genomics | VOL. 18
Sebastian J Teran Hidalgo, et. al.Sebastian J Teran Hidalgo ... Shuangge Ma
16 Aug 2017
BMC Genomics | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering multilayer omics data using MuNCut

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics