Abstract

BackgroundIdentifying protein complexes is an essential task for understanding the mechanisms of proteins in cells. Many computational approaches have thus been developed to identify protein complexes in protein-protein interaction (PPI) networks. Regarding the information that can be adopted by computational approaches to identify protein complexes, in addition to the graph topology of PPI network, the consideration of functional information of proteins has been becoming popular recently. Relevant approaches perform their tasks by relying on the idea that proteins in the same protein complex may be associated with similar functional information. However, we note from our previous researches that for most protein complexes their proteins are only similar in specific subsets of categories of functional information instead of the entire set. Hence, if the preference of each functional category can also be taken into account when identifying protein complexes, the accuracy will be improved.ResultsTo implement the idea, we first introduce a preference vector for each of proteins to quantitatively indicate the preference of each functional category when deciding the protein complex this protein belongs to. Integrating functional preferences of proteins and the graph topology of PPI network, we formulate the problem of identifying protein complexes into a constrained optimization problem, and we propose the approach DCAFP to address it. For performance evaluation, we have conducted extensive experiments with several PPI networks from the species of Saccharomyces cerevisiae and Human and also compared DCAFP with state-of-the-art approaches in the identification of protein complexes. The experimental results show that considering the integration of functional preferences and dense structures improved the performance of identifying protein complexes, as DCAFP outperformed the other approaches for most of PPI networks based on the assessments of independent measures of f-measure, Accuracy and Maximum Matching Rate. Furthermore, the function enrichment experiments indicated that DCAFP identified more protein complexes with functional significance when compared with approaches, such as PCIA, that also utilize the functional information.ConclusionsAccording to the promising performance of DCAFP, the integration of functional preferences and dense structures has made it possible to identify protein complexes more accurately and significantly.

Highlights

  • Identifying protein complexes is an essential task for understanding the mechanisms of proteins in cells

  • We chose to use more than one protein-protein interaction (PPI) networks of Saccharomyces cerevisiae as they were all different in terms of unreliability resulted from different PPI identification processes [37]

  • Comparing DCAFP with approaches that only considered the graph topology of PPI network, we found that DCAFP achieved a better performance than most of them for each of PPI networks as it made use of functional information to improve the performance of identifying protein complexes

Read more

Summary

Introduction

Identifying protein complexes is an essential task for understanding the mechanisms of proteins in cells. Regarding the information that can be adopted by computational approaches to identify protein complexes, in addition to the graph topology of PPI network, the consideration of functional information of proteins has been becoming popular recently. Relevant approaches perform their tasks by relying on the idea that proteins in the same protein complex may be associated with similar functional information. Protein complexes discovered in PPI networks can lead to a better understanding of the roles of proteins in different cellular systems It is for this reason that the problem of identifying protein complexes has been being popular over the last decade. For laboratory-based techniques the set of protein complexes that can be identified by them is usually incomplete, as some protein complexes may not be able to be discovered under current experimental conditions [9]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call