OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data

Xiaoyu Zhang,Yike Guo,Kai Sun,Yuting Xing

doi:10.3390/cancers13123047

Xiaoyu Zhang, Yike Guo + Show 2 more

Open Access

https://doi.org/10.3390/cancers13123047

Copy DOI

Journal: Cancers	Publication Date: Jun 18, 2021
Citations: 53	License type: CC BY 4.0

Affiliation: Imperial College London, Hong Kong Baptist University

Abstract

Simple SummaryOmiEmbed is a unified multi-task deep learning framework for multi-omics data, supporting dimensionality reduction, multi-omics integration, tumour type classification, phenotypic feature reconstruction and survival prediction. The framework is comprised of deep embedding and downstream task modules to capture biomedical information from high-dimensional omics data. OmiEmbed outperformed state-of-the-art methods on all three types of downstream tasks: classification, regression and survival prediction. Better performance was achieved using the multi-task training strategy compared to training each downstream task individually. With the multi-task strategy, OmiEmbed learnt a comprehensive omics embedding containing information from multiple tasks. OmiEmbed is open source, well-organised and convenient to be extended to other customised input data, network structures and downstream tasks, which has promising potential to facilitate more accurate and personalised clinical decision making.High-dimensional omics data contain intrinsic biomedical information that is crucial for personalised medicine. Nevertheless, it is challenging to capture them from the genome-wide data, due to the large number of molecular features and small number of available samples, which is also called “the curse of dimensionality” in machine learning. To tackle this problem and pave the way for machine learning-aided precision medicine, we proposed a unified multi-task deep learning framework named OmiEmbed to capture biomedical information from high-dimensional omics data with the deep embedding and downstream task modules. The deep embedding module learnt an omics embedding that mapped multiple omics data types into a latent space with lower dimensionality. Based on the new representation of multi-omics data, different downstream task modules were trained simultaneously and efficiently with the multi-task strategy to predict the comprehensive phenotype profile of each sample. OmiEmbed supports multiple tasks for omics data including dimensionality reduction, tumour type classification, multi-omics integration, demographic and clinical feature reconstruction, and survival prediction. The framework outperformed other methods on all three types of downstream tasks and achieved better performance with the multi-task strategy compared to training them individually. OmiEmbed is a powerful and unified framework that can be widely adapted to various applications of high-dimensional omics data and has great potential to facilitate more accurate and personalised clinical decision making.

Highlights

This article is an open access articleWith the increasingly massive amount of omics data generated from emerging highthroughput technologies, the large-scale, cost-efficient and comprehensive analysis of biological molecules becomes an everyday methodology for biomedical researchers [1,2].The analysis and assessment of different types of omics data facilitate the integration of molecular features during the standard diagnostic procedure
The OmiEmbed multi-omics multi-task framework was built on the deep learning library PyTorch [29]
The detailed network structures of both the FC-type and convolutional neural network (CNN)-type deep embedding modules were illustrated in Supplementary

Summary

Introduction

This article is an open access articleWith the increasingly massive amount of omics data generated from emerging highthroughput technologies, the large-scale, cost-efficient and comprehensive analysis of biological molecules becomes an everyday methodology for biomedical researchers [1,2].The analysis and assessment of different types of omics data facilitate the integration of molecular features during the standard diagnostic procedure. Tumours [3], an integrative method combining both histopathology and molecular information was recommended for the identification of multiple tumour entities. Most of these molecular features designed to aid diagnosis are manually selected biomarkers referring to specific genetic alterations, which neglects the genome-wide patterns correlated with disease prognosis and other phenotypic outcomes. Instead of focusing on the effect of specific molecular features, many researchers began to delve into the overall picture of genome-wide omics data and try to obtain the deep understanding of diseases and uncover crucial diagnostic or prognostic information from it [4,5,6,7].

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cancers

Lead the way for us

Similar Papers

Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification
Xiaoyu Zhang ... Yike Guo
-
Xiaoyu Zhang, et. al.Xiaoyu Zhang ... Yike Guo
01 Nov 2019
01 Nov 2019

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification
Sayed Hashim ... Muhammad Ali
-
Sayed Hashim, et. al.Sayed Hashim ... Muhammad Ali
13 May 2022
13 May 2022

Quality control plot for high dimensional omics data
Gyu-Tae Kim ... Yongkang Kim
-
Gyu-Tae Kim, et. al. Gyu-Tae Kim ... Yongkang Kim
01 Dec 2016
01 Dec 2016

A Unified Multi-task Adversarial Learning Framework for Pharmacovigilance Mining
Shweta Yadav ... Asif Ekbal
-
Shweta Yadav, et. al.Shweta Yadav ... Asif Ekbal
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OmiEmbed: A Unified Multi-Task Deep Learning Framework for Multi-Omics Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cancers