Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.

Ge Wang,Yu-Jia Zhai,Zhen-Zhen Xue,Ying-Ying Xu

doi:10.3390/biom11111607

Abstract

The subcellular locations of proteins are closely related to their functions. In the past few decades, the application of machine learning algorithms to predict protein subcellular locations has been an important topic in proteomics. However, most studies in this field used only amino acid sequences as the data source. Only a few works focused on other protein data types. For example, three-dimensional structures, which contain far more functional protein information than sequences, remain to be explored. In this work, we extracted various handcrafted features to describe the protein structures from physical, chemical, and topological aspects, as well as the learned features obtained by deep neural networks. We then used these features to classify the protein subcellular locations. Our experimental results demonstrated that some of these structural features have a certain effect on the protein location classification, and can help improve the performance of sequence-based location predictors. Our method provides a new view for the analysis of protein spatial distribution, and is anticipated to be used in revealing the relationships between protein structures and functions.

Highlights

Given that subcellular/organelle structures in cells provide specific physiological and functional environments, the determination of the subcellular locations of proteins is believed to be an important aspect of the understanding of their functions [1,2]
Some prediction methods, such as Hum-mPLoc 3.0 [4] and SCLpred [5], constructed sequence features through a target signal search, motif analysis, or homology transfer, while some works in recent years, like DeepLoc [6] and HumDLoc [7], employed deep learning models to learn the protein features automatically
In order to test the ability of the above descriptors to distinguish subcellular protein locations, we used t-distributed stochastic neighbor embedding (t-SNE) to visualize

Summary

Introduction

Given that subcellular/organelle structures in cells provide specific physiological and functional environments, the determination of the subcellular locations of proteins is believed to be an important aspect of the understanding of their functions [1,2]. The theoretical basis of the predictions is that one protein is transported into specific subcellular structure(s) according to its signal peptide, which is a short segment buried in the amino acid sequence. Some prediction methods, such as Hum-mPLoc 3.0 [4] and SCLpred [5], constructed sequence features through a target signal search, motif analysis, or homology transfer, while some works in recent years, like DeepLoc [6] and HumDLoc [7], employed deep learning models to learn the protein features automatically.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Biomolecules	Publication Date: Oct 29, 2021
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biomolecules

Lead the way for us

Similar Papers

ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization
Wen-Lin Huang ... Shinn-Ying Ho
BMC Bioinformatics | VOL. 9
Wen-Lin Huang, et. al.Wen-Lin Huang ... Shinn-Ying Ho
01 Feb 2008
BMC Bioinformatics | VOL. 9

A review from biological mapping to computation-based subcellular localization
Jing Li ... Lei Yuan
Molecular therapy. Nucleic acids | VOL. 32
Jing Li, et. al.Jing Li ... Lei Yuan
20 Apr 2023
Molecular therapy. Nucleic acids | VOL. 32

Predicting the Subcellular Localization of Human Proteins Using Machine Learning and Exploratory Data Analysis
George K Acquaah-Mensah ... Chittibabu Guda
Genomics, Proteomics & Bioinformatics | VOL. 4
George K Acquaah-Mensah, et. al.George K Acquaah-Mensah ... Chittibabu Guda
01 Jun 2006
Genomics, Proteomics & Bioinformatics | VOL. 4

Identifying protein subcellular localisation in scientific literature using bidirectional deep recurrent neural network
Rakesh David ... Gustavo Carneiro
Scientific Reports | VOL. 11
Rakesh David, et. al.Rakesh David ... Gustavo Carneiro
18 Jan 2021
Scientific Reports | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Biomolecules