BMRF-MI: integrative identification of protein interaction network by modeling the gene dependency.

Xu Shi,Ayesha Shajahan,Xiao Wang,Leena Hilakivi-Clarke,Robert Clarke,Jianhua Xuan

doi:10.1186/1471-2164-16-s7-s10

Abstract

BackgroundIdentification of protein interaction network is a very important step for understanding the molecular mechanisms in cancer. Several methods have been developed to integrate protein-protein interaction (PPI) data with gene expression data for network identification. However, they often fail to model the dependency between genes in the network, which makes many important genes, especially the upstream genes, unidentified. It is necessary to develop a method to improve the network identification performance by incorporating the dependency between genes.ResultsWe proposed an approach for identifying protein interaction network by incorporating mutual information (MI) into a Markov random field (MRF) based framework to model the dependency between genes. MI is widely used in information theory to measure the uncertainty between random variables. Different from traditional Pearson correlation test, MI is capable of capturing both linear and non-linear relationship between random variables. Among all the existing MI estimators, we choose to use k-nearest neighbor MI (kNN-MI) estimator which is proved to have minimum bias. The estimated MI is integrated with an MRF framework to model the gene dependency in the context of network. The maximum a posterior (MAP) estimation is applied on the MRF-based model to estimate the network score. In order to reduce the computational complexity of finding the optimal network, a probabilistic searching algorithm is implemented. We further increase the robustness and reproducibility of the results by applying a non-parametric bootstrapping method to measure the confidence level of the identified genes. To evaluate the performance of the proposed method, we test the method on simulation data under different conditions. The experimental results show an improved accuracy in terms of subnetwork identification compared to existing methods. Furthermore, we applied our method onto real breast cancer patient data; the identified protein interaction network shows a close association with the recurrence of breast cancer, which is supported by functional annotation. We also show that the identified subnetworks can be used to predict the recurrence status of cancer patients by survival analysis.ConclusionsWe have developed an integrated approach for protein interaction network identification, which combines Markov random field framework and mutual information to model the gene dependency in PPI network. Improvements in subnetwork identification have been demonstrated with simulation datasets compared to existing methods. We then apply our method onto breast cancer patient data to identify recurrence related subnetworks. The experiment results show that the identified genes are enriched in the pathway and functional categories relevant to progression and recurrence of breast cancer. Finally, the survival analysis based on identified subnetworks achieves a good result of classifying the recurrence status of cancer patients.

Highlights

Identification of protein interaction network is a very important step for understanding the molecular mechanisms in cancer
In order to address the concern of gene interaction, Chen et al proposed a bagging Markov random field (BMRF) based method to improve the protein interaction subnetwork identification
BMRF employs maximum a posterior (MAP) principle to estimate the differential score of genes or proteins and form a novel network score that considers the pairwise gene interaction in the subnetworks

Summary

Introduction

Identification of protein interaction network is a very important step for understanding the molecular mechanisms in cancer. Several methods have been developed to integrate protein-protein interaction (PPI) data with gene expression data for network identification. They often fail to model the dependency between genes in the network, which makes many important genes, especially the upstream genes, unidentified. Several methods [3,4,5,6] have been developed to integrate protein-protein interaction (PPI) data with microarray gene expression data to identify significant protein interaction networks. In order to address the concern of gene interaction, Chen et al proposed a bagging Markov random field (BMRF) based method to improve the protein interaction subnetwork identification. We need to quantify the dependency between genes to reduce the negative effect of false connections and improve the performance

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jun 11, 2015
Citations: 20	License type: cc-by

R Discovery Prime

R Discovery Prime

BMRF-MI: integrative identification of protein interaction network by modeling the gene dependency.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Analysis of fMRI time series with mutual information
Vanessa Gómez-Verdejo ... Antonio Oliviero
Medical Image Analysis | VOL. 16
Vanessa Gómez-Verdejo, et. al.Vanessa Gómez-Verdejo ... Antonio Oliviero
25 Nov 2011
Medical Image Analysis | VOL. 16

Mutual information is critically dependent on prior assumptions: would the correct estimate of mutual information please identify itself?
Andrew D Fernandes ... Gregory B Gloor
Bioinformatics | VOL. 26
Andrew D Fernandes, et. al.Andrew D Fernandes ... Gregory B Gloor
17 Mar 2010
Bioinformatics | VOL. 26

Mutual information is critically dependent on prior assumptions: would the correct estimate of mutual information please identify itself?
A D Fernandes ... G B Gloor
Bioinformatics | VOL. 26
A D Fernandes, et. al.A D Fernandes ... G B Gloor
16 Sep 2010
Bioinformatics | VOL. 26

Estimation of mutual information by the fuzzy histogram
Maryam Amir Haeri ... Mohammad Mehdi Ebadzadeh
Fuzzy Optimization and Decision Making | VOL. 13
Maryam Amir Haeri, et. al.Maryam Amir Haeri ... Mohammad Mehdi Ebadzadeh
13 Feb 2014
Fuzzy Optimization and Decision Making | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BMRF-MI: integrative identification of protein interaction network by modeling the gene dependency.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics