Abstract

Detecting protein complexes from protein-protein interaction (PPI) networks is a challenging task in computational biology. A vast number of computational methods have been proposed to undertake this task. However, each computational method is developed to capture one aspect of the network. The performance of different methods on the same network can differ substantially, even the same method may have different performance on networks with different topological characteristic. The clustering result of each computational method can be regarded as a feature that describes the PPI network from one aspect. It is therefore desirable to utilize these features to produce a more accurate and reliable clustering. In this paper, a novel Bayesian Nonnegative Matrix Factorization(NMF)-based weighted Ensemble Clustering algorithm (EC-BNMF) is proposed to detect protein complexes from PPI networks. We first apply different computational algorithms on a PPI network to generate some base clustering results. Then we integrate these base clustering results into an ensemble PPI network, in the form of weighted combination. Finally, we identify overlapping protein complexes from this network by employing Bayesian NMF model. When generating an ensemble PPI network, EC-BNMF can automatically optimize the values of weights such that the ensemble algorithm can deliver better results. Experimental results on four PPI networks of Saccharomyces cerevisiae well verify the effectiveness of EC-BNMF in detecting protein complexes. EC-BNMF provides an effective way to integrate different clustering results for more accurate and reliable complex detection. Furthermore, EC-BNMF has a high degree of flexibility in the choice of base clustering results. It can be coupled with existing clustering methods to identify protein complexes.

Highlights

  • Protein-protein interactions (PPI) are fundamental to the biological processes within cells [1]

  • By applying EC-Bayesian NMF model (BNMF) on four yeast PPI networks, we show that EC-BNMF has competitive performance with the state-of-the-art algorithms in detecting protein complexes

  • We hope that the ensemble PPI network can approximate the intrinsic of the original PPI network, we propose an alternative approach by assuming that the ensemble PPI network is a weighted combination of these feature networks

Read more

Summary

Introduction

Protein-protein interactions (PPI) are fundamental to the biological processes within cells [1]. Protein complexes can help us to predict the functions of proteins [3,4]. In the post-genomic era, predicting protein complexes is crucial. To address this problem, several biological experimental methods have been developed for detecting protein complexes. As mentioned in [2,8], these methods have some inevitable limitations such as too much time consuming. Due to these experimental limitations, it is quite necessary to develop computational approaches which can be acted as useful complements to the experimental methods for detecting protein complexes

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.