A New Method for Predicting the Subcellular Localization of Eukaryotic Proteins with Both Single and Multiple Sites: Euk-mPLoc 2.0

Kuo-Chen Chou,Hong-Bin Shen

doi:10.1371/journal.pone.0009931

Kuo-Chen Chou, Hong-Bin Shen

Open Access

https://doi.org/10.1371/journal.pone.0009931

Copy DOI

Abstract

Information of subcellular locations of proteins is important for in-depth studies of cell biology. It is very useful for proteomics, system biology and drug development as well. However, most existing methods for predicting protein subcellular location can only cover 5 to 12 location sites. Also, they are limited to deal with single-location proteins and hence failed to work for multiplex proteins, which can simultaneously exist at, or move between, two or more location sites. Actually, multiplex proteins of this kind usually posses some important biological functions worthy of our special notice. A new predictor called “Euk-mPLoc 2.0” is developed by hybridizing the gene ontology information, functional domain information, and sequential evolutionary information through three different modes of pseudo amino acid composition. It can be used to identify eukaryotic proteins among the following 22 locations: (1) acrosome, (2) cell wall, (3) centriole, (4) chloroplast, (5) cyanelle, (6) cytoplasm, (7) cytoskeleton, (8) endoplasmic reticulum, (9) endosome, (10) extracell, (11) Golgi apparatus, (12) hydrogenosome, (13) lysosome, (14) melanosome, (15) microsome (16) mitochondria, (17) nucleus, (18) peroxisome, (19) plasma membrane, (20) plastid, (21) spindle pole body, and (22) vacuole. Compared with the existing methods for predicting eukaryotic protein subcellular localization, the new predictor is much more powerful and flexible, particularly in dealing with proteins with multiple locations and proteins without available accession numbers. For a newly-constructed stringent benchmark dataset which contains both single- and multiple-location proteins and in which none of proteins has pairwise sequence identity to any other in a same location, the overall jackknife success rate achieved by Euk-mPLoc 2.0 is more than 24% higher than those by any of the existing predictors. As a user-friendly web-server, Euk-mPLoc 2.0 is freely accessible at http://www.csbio.sjtu.edu.cn/bioinf/euk-multi-2/. For a query protein sequence of 400 amino acids, it will take about 15 seconds for the web-server to yield the predicted result; the longer the sequence is, the more time it may usually need. It is anticipated that the novel approach and the powerful predictor as presented in this paper will have a significant impact to Molecular Cell Biology, System Biology, Proteomics, Bioinformatics, and Drug Development.

Highlights

With the avalanche of protein sequences generated in the postgenomic era, numerous efforts have been made to develop various methods for predicting protein subcellular localization based on the sequence information
As pointed out by Millar et al [13], recent evidences indicate that an increasing number of proteins have multiple locations in the cell
Its power mainly came from the GO approach because proteins formulated in the GO database space would be clustered in a manner much better reflecting the distribution of their subcellular locations, as elucidated in [18]

Summary

Introduction

With the avalanche of protein sequences generated in the postgenomic era, numerous efforts have been made to develop various methods for predicting protein subcellular localization based on the sequence information (see, e.g., [1,2,3,4,5,6,7,8] as well as a long list of references cited in two comprehensive review articles [9,10]).

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Apr 1, 2010
Citations: 425	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A New Method for Predicting the Subcellular Localization of Eukaryotic Proteins with Both Single and Multiple Sites: Euk-mPLoc 2.0

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

ILoc-Plant: a multi-label classifier for predicting the subcellular localization of plant proteins with both single and multiple sites
Zhi-Cheng Wu ... Xuan Xiao
Molecular BioSystems | VOL. 7
Zhi-Cheng Wu, et. al.Zhi-Cheng Wu ... Xuan Xiao
01 Jan 2010
Molecular BioSystems | VOL. 7

ILoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites
Kuo-Chen Chou ... Zhi-Cheng Wu
Mol. BioSyst. | VOL. 8
Kuo-Chen Chou, et. al.Kuo-Chen Chou ... Zhi-Cheng Wu
01 Jan 2012
Mol. BioSyst. | VOL. 8

Plant-mPLoc: A Top-Down Strategy to Augment the Power for Predicting Plant Protein Subcellular Localization
Kuo-Chen Chou ... Hong-Bin Shen
PLoS ONE | VOL. 5
Kuo-Chen Chou, et. al.Kuo-Chen Chou ... Hong-Bin Shen
28 Jun 2010
PLoS ONE | VOL. 5

A Multi-Label Classifier for Predicting the Subcellular Localization of Gram-Negative Bacterial Proteins with Both Single and Multiple Sites
Xuan Xiao ... Zhi-Cheng Wu
PLoS ONE | VOL. 6
Xuan Xiao, et. al.Xuan Xiao ... Zhi-Cheng Wu
17 Jun 2011
PLoS ONE | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A New Method for Predicting the Subcellular Localization of Eukaryotic Proteins with Both Single and Multiple Sites: Euk-mPLoc 2.0

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE