Identifying Protein Subcellular Location with Embedding Features Learned from Networks

Hongwei Liu,Lin Lu,Bin Hu,Lei Chen

doi:10.2174/1570164617999201124142950

Abstract

Background: Identification of protein subcellular location is an important problem because the subcellular location is highly related to protein function. It is fundamental to determine the locations with biology experiments. However, these experiments are of high costs and time-consuming. The alternative way to address such a problem is to design effective computational methods. Objective: To date, several computational methods have been proposed in this regard. However, these methods mainly adopted the features derived from the proteins themselves. On the other hand, with the development of the network technique, several embedding algorithms have been proposed, which can encode nodes in the network into feature vectors. Such algorithms connected the network and traditional classification algorithms. Thus, they provided a new way to construct models for the prediction of protein subcellular location. Methods: In this study, we analyzed features produced by three network embedding algorithms (DeepWalk, Node2vec and Mashup) that were applied on one or multiple protein networks. Obtained features were learned by one machine learning algorithm (support vector machine or random forest) to construct the model. The cross-validation method was adopted to evaluate all constructed models. Results: After evaluating models with the cross-validation method, embedding features yielded by Mashup on multiple networks were quite informative for predicting protein subcellular location. The model based on these features were superior to some classic models. Conclusion: Embedding features yielded by a proper and powerful network embedding algorithm were effective for building the model for prediction of protein subcellular location, providing new pipelines to build more efficient models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifying Protein Subcellular Location with Embedding Features Learned from Networks

Abstract

Talk to us

Similar Papers

More From: Current Proteomics

Lead the way for us

Journal: Current Proteomics	Publication Date: Nov 23, 2021
Citations: 31

Similar Papers

Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.
Bo Li ... Lijun Cai
Molecules | VOL. 24
Bo Li, et. al.Bo Li ... Lijun Cai
06 Mar 2019
Molecules | VOL. 24

Prediction of human protein subcellular localization using deep learning
Leyi Wei ... Quan Zou
Journal of Parallel and Distributed Computing | VOL. 117
Leyi Wei, et. al.Leyi Wei ... Quan Zou
24 Aug 2017
Journal of Parallel and Distributed Computing | VOL. 117

Using Nearest Feature Line and Tunable Nearest Neighbor methods for prediction of protein subcellular locations
Qing-Bin Gao ... Zheng-Zhi Wang
Computational Biology and Chemistry | VOL. 29
Qing-Bin Gao, et. al.Qing-Bin Gao ... Zheng-Zhi Wang
01 Oct 2005
Computational Biology and Chemistry | VOL. 29

DeepLoc: prediction of protein subcellular localization using deep learning.
José Juan Almagro Armenteros ... Henrik Nielsen
Bioinformatics | VOL. 33
José Juan Almagro Armenteros, et. al.José Juan Almagro Armenteros ... Henrik Nielsen
07 Jul 2017
Bioinformatics | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying Protein Subcellular Location with Embedding Features Learned from Networks

Abstract

Talk to us

Similar Papers

More From: Current Proteomics