Sequence-driven features for prediction of subcellular localization of proteins

Jong Kyoung Kim,Sung-Yang Bang,Seungjin Choi

doi:10.1016/j.patcog.2006.02.021

Jong Kyoung Kim, Sung-Yang Bang + Show 1 more

https://doi.org/10.1016/j.patcog.2006.02.021

Copy DOI

Abstract

Prediction of the cellular location of a protein plays an important role in inferring the function of the protein. Feature extraction is a critical part in prediction systems, requiring raw sequence data to be transformed into appropriate numerical feature vectors while minimizing information loss. In this paper, we present a method for extracting useful features from protein sequence data. The method employs local and global pairwise sequence alignment scores as well as composition-based features. Five different features are used for training support vector machines (SVMs) separately and a weighted majority voting makes a final decision. The overall prediction accuracy evaluated by the 5-fold cross-validation reached 88.53% for the eukaryotic animal data set. Comparing the prediction accuracy of various feature extraction methods, provides a biological insight into the location of targeting information. Our experimental results confirm that our feature extraction methods are very useful for predicting subcellular localization of proteins.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequence-driven features for prediction of subcellular localization of proteins

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Apr 17, 2006
Citations: 37

Similar Papers

Predicting the cofactors of oxidoreductases based on amino acid composition distribution and Chou's amphiphilic pseudo-amino acid composition
Guang-Ya Zhang ... Bai-Shan Fang
Journal of Theoretical Biology | VOL. 253
Guang-Ya Zhang, et. al.Guang-Ya Zhang ... Bai-Shan Fang
19 Mar 2008
Journal of Theoretical Biology | VOL. 253

Prediction of subcellular localization of proteins using pairwise sequence alignment and support vector machine
Jong Kyoung Kim ... Seungjin Choi
Pattern Recognition Letters | VOL. 27
Jong Kyoung Kim, et. al.Jong Kyoung Kim ... Seungjin Choi
03 Feb 2006
Pattern Recognition Letters | VOL. 27

The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins
Lei Wang ... Dong Wang
Bioengineered | VOL. 9
Lei Wang, et. al.Lei Wang ... Dong Wang
22 Nov 2017
Bioengineered | VOL. 9

Feature-Based Causal Structure Discovery in Protein and Gene Expression Data with Bayesian Network
Jingwei Liu ... Minping Qian
-
Jingwei Liu, et. al.Jingwei Liu ... Minping Qian
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence-driven features for prediction of subcellular localization of proteins

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition