Using Alias Sampling Strategy Based on Network Embeddings to Detect Protein Complexes

Xiaoxia Liu,Shengtian Sang,Xiaoxu Wang

doi:10.1109/access.2020.3040327

Xiaoxia Liu, Shengtian Sang + Show 1 more

Open Access

https://doi.org/10.1109/access.2020.3040327

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 35	License type: CC BY 4.0

Affiliation: Dalian Maritime University, Stanford University

Abstract

Detecting protein complexes from available protein-protein interaction (PPI) data will help to deeply understand the mechanism of the biological activities. In recent years, various computational methods have been developed for identifying protein complexes from PPI networks. Almost all the basic computational methods mainly depend on the association of topological analysis of PPI networks. However, most of them fail to satisfactorily capture the global and local topological structures of the PPI networks, as well as the diversity of connectivity patterns between individual nodes at the same time. To solve this problem, in this work we propose a node embedding based alias sampling extension method to detect protein complexes. More specifically, for a given set of seed nodes, it first uses the alias sampling strategy based on protein node embedding similarities to select potential addable nodes. Then it makes use of a new conductance measure, which could better quantify the likelihood of a subgraph being a protein complex, to decide whether to extend the current candidate subgraph in order to find protein complexes. Evaluated on six real yeast PPI networks, our method outperforms state-of-the-art methods in detecting protein complexes. Furthermore, the experimental results demonstrate the protein complexes predicted by our method have higher biological significance.

Highlights

A Protein complex is a group of proteins that physically interact with one another to organize various biological processes in the cell
RESULTS we introduce the evaluation metrics and compare our method against the four well-known complex detection approaches on six yeast proteinprotein interaction (PPI) networks
We found that our method outperforms other six state-of-the-art algorithms in identifying protein complexes

Summary

Introduction

A Protein complex is a group of proteins that physically interact with one another to organize various biological processes in the cell. The main line of the approaches for identifying protein complexes from PPI network is based on the observation of the inherent topological structures of protein complexes [4], [5]. Identifying protein complexes can be formulated as searching for subgraphs that are densely connected inside and well separated from the rest of the networks. Considering this basic idea, the detection methods for protein complexes based on machine learning and data mining have grown rapidly and become useful ways to identify protein complexes. Multiple researches have proved combining extra wellselected biological information would improve the performance of protein complex detection [6], [7]. We only discuss the methods that only use topological characteristics of the network, since the biological information could be added to most of the methods to improve the performance

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Alias Sampling Strategy Based on Network Embeddings to Detect Protein Complexes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Protein complex detection in PPI networks based on data integration and supervised learning method.
Feng Ying Yu ... Xiao Hua Hu
BMC Bioinformatics | VOL. Suppl 16 12
Feng Ying Yu, et. al.Feng Ying Yu ... Xiao Hua Hu
25 Aug 2015
BMC Bioinformatics | VOL. Suppl 16 12

Biomolecular networks and human diseases.
Fangxiang Wu ... Reda Alhajj
BioMed research international | VOL. 2014
Fangxiang Wu, et. al.Fangxiang Wu ... Reda Alhajj
01 Jan 2014
BioMed research international | VOL. 2014

PC-SENE: A node embedding based method for protein complex detection
Xiaoxia Liu ... Yijia Zhang
-
Xiaoxia Liu, et. al.Xiaoxia Liu ... Yijia Zhang
01 Dec 2018
01 Dec 2018

Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks
Xiaoxia Liu ... Yin Zhang
BMC Bioinformatics | VOL. 19
Xiaoxia Liu, et. al.Xiaoxia Liu ... Yin Zhang
21 Sep 2018
BMC Bioinformatics | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Alias Sampling Strategy Based on Network Embeddings to Detect Protein Complexes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access