Abstract

BackgroundThe systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms.ResultsWe have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches.ConclusionThe integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.

Highlights

  • The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions

  • Gene Ontology (GO) [10] is another useful data source to combine with the protein interaction networks

  • We investigated whether each interacting pair appears in the annotation of the same functional and locational category

Read more

Summary

Introduction

The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. They have been limited in accuracy due to the presence of unreliable interactions and the complex connectivity patterns of the networks. The correlations of mRNA levels with even cognate protein expression may be modest at best These factors limit the usefulness of microarray data for assessing the reliability of protein-protein interactions. Gene Ontology (GO) [10] is another useful data source to combine with the protein interaction networks. The GO is currently one of the most comprehensive and well-curated ontology databases in the bioinformatics community It provides a collection of well-defined biological terms, called GO terms, spanning biological processes, molecular functions and cellular components. The GO has been used to facilitate the analysis of gene expression data [1113]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.