MVDA: a multi-view genomic data integration methodology.

Angela Serra,Giancarlo Raiconi,Michele Fratello,Roberto Tagliaferri,Dario Greco,Vittorio Fortino

doi:10.1186/s12859-015-0680-3

Abstract

BackgroundMultiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications.ResultsWe propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results.ConclusionOur observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/.Electronic supplementary materialThe online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users.

Highlights

Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals
We compared it with recently developed methods: the integrative clustering algorithm, namely SNF [9]. and the Tw-Kmeans [7], an early integration multi-view clustering model
Our unsupervised method shows a mean error of 27,47 %, normalized mutual information (NMI) of 28 % and stability of 85 %

Summary

Results

We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. We found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results

Conclusion

Background

Results and Discussion

Conclusions

Methods

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Aug 19, 2015
Citations: 96	License type: cc-by

R Discovery Prime

R Discovery Prime

MVDA: a multi-view genomic data integration methodology.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Prior Information Biases Stimulus Representations during Vibrotactile Decision Making
Claudia Preuschhof ... Hauke R Heekeren
Journal of Cognitive Neuroscience | VOL. 22
Claudia Preuschhof, et. al.Claudia Preuschhof ... Hauke R Heekeren
01 May 2010
Journal of Cognitive Neuroscience | VOL. 22

Five omic technologies are concordant in differentiating the biochemical characteristics of the berries of five grapevine (Vitis vinifera L.) cultivars.
Ryan Ghan ... Karen A Schlauch
BMC Genomics | VOL. 16
Ryan Ghan, et. al.Ryan Ghan ... Karen A Schlauch
16 Nov 2015
BMC Genomics | VOL. 16

Application of Bayesian genomic prediction methods to genome-wide association analyses
Anna Wolc ... Jack C M Dekkers
Genetics Selection Evolution | VOL. 54
Anna Wolc, et. al.Anna Wolc ... Jack C M Dekkers
13 May 2022
Genetics Selection Evolution | VOL. 54

ColoWeb: a resource for analysis of colocalization of genomic features.
Ryangguk Kim ... Mirit I Aladjem
BMC Genomics | VOL. 16
Ryangguk Kim, et. al.Ryangguk Kim ... Mirit I Aladjem
28 Feb 2015
BMC Genomics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MVDA: a multi-view genomic data integration methodology.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics