Abstract

Identifying protein complexes in protein-protein interaction (ppi) networks is often handled as a community detection problem, with algorithms generally relying exclusively on the network topology for discovering a solution. The advancement of experimental techniques on ppi has motivated the generation of many Gene Ontology (go) databases. Incorporating the functionality extracted from go with the topological properties from the underlying ppi network yield a novel approach to identify protein complexes. Additionally, most of the existing algorithms use global measures that operate on the entire network to identify communities. The result of using global metrics are large communities that are often not correlated with the functionality of the proteins. Moreover, ppi network analysis shows that most of the biological functions possibly lie between local neighbours in ppi networks, which are not identifiable with global metrics. In this paper, we propose a local community detection algorithm, (lcda-go), that uniquely exploits information of functionality from go combined with the network topology. lcda-go identifies the community of each protein based on the topological and functional knowledge acquired solely from the local neighbour proteins within the ppi network. Experimental results using the Krogan dataset demonstrate that our algorithm outperforms in most cases state-of-the-art approaches in assessment based on Precision, Sensitivity, and particularly Composite Score. We also deployed lcda, the local-topology based precursor of lcda-go, to compare with a similar state-of-the-art approach that exclusively incorporates topological information of ppi networks for community detection. In addition to the high quality of the results, one main advantage of lcda-go is its low computational time complexity.

Highlights

  • Proteins work cooperatively to accomplish biological functions

  • Identifying protein complexes is an important step for biological knowledge discovery since several biological processes are accomplished in the formation of protein complexes

  • We propose a local community detection algorithm, LCDA-GO, for protein complexes by exploiting Gene Ontology (GO)

Read more

Summary

Introduction

Proteins work cooperatively to accomplish biological functions. The physical interaction between proteins, known as protein-protein interaction (PPI), is the key for many biological functions [1], for example, the transcription of DNA, the translation of mRNA, and cell cycle [2]. The first term refers to the physical interaction describing the arrangements of the nodes in the network, and is associated with the densely connected proteins namely communities The latter explains the biological function of proteins that are achieved by groups of proteins that bind each other and shape protein complexes. The state-of-the-art solutions consider different objectives to divide the nodes of a given network into highly interconnected communities [18,19,20] Some of these algorithms are adjusted to biological networks to tackle the protein complex detection in PPI networks [21], including C-FINDER, COACH, CLUSTERONE, MCL, CMC, MCODE, and CORE&PEEL. We propose LCDA-GO, a local community detection algorithm that combines topological and functional properties (i.e., GO terms) of PPI networks to detect associated communities that are representing protein complexes. We have used the LCDA algorithm [29], the localtopology based precursor of LCDA-GO

Related work
A PPI network
Experiments and results
Evaluation metrics
Discussion and conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.