Abstract

BackgroundThe accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine. However, the algorithm development for multi-omics data integration remains a pressing challenge.ResultsHere, we propose a multi-omics data integration algorithm based on random walk with restart (RWR) on multiplex network. We call the resulting methodology Random Walk with Restart for multi-dimensional data Fusion (RWRF). RWRF uses similarity network of samples as the basis for integration. It constructs the similarity network for each data type and then connects corresponding samples of multiple similarity networks to create a multiplex sample network. By applying RWR on the multiplex network, RWRF uses stationary probability distribution to fuse similarity networks. We applied RWRF to The Cancer Genome Atlas (TCGA) data to identify subtypes in different cancer data sets. Three types of data (mRNA expression, DNA methylation, and microRNA expression data) are integrated and network clustering is conducted. Experiment results show that RWRF performs better than single data type analysis and previous integrative methods.ConclusionsRWRF provides powerful support to users to decipher the cancer molecular subtypes, thus may benefit precision treatment of specific patients in clinical practice.

Highlights

  • The accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine

  • We proposed a multi-omics data integration algorithm based on random walk with restart (RWR) on multiplex network

  • Overview RWRF and RWRNF are two multi-omics data integration algorithms based on random walk with restart on multiplex network

Read more

Summary

Introduction

The accumulation of various multi-omics data and computational approaches for data integration can accelerate the development of precision medicine. The algorithm development for multi-omics data integration remains a pressing challenge. The rapid development of biotechnology enables researchers to generate multiple types of biomedical data. Compared with methods that use only a single data type, data integration approach enables more comprehensive and informative analysis of biomedical data. Integrating multiple data types can compensate for missing or unreliable information in any single data type, and multiple sources of evidence pointing to the same result are less likely to lead to false positives. Algorithms for integrating multi-omics or multi-dimensional biomedical data become indispensable key technologies for multiomics research and new algorithms are increasingly needed

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.