Using protein granularity to extract the protein sequence features

Zhi-Xin Liu,Song-Lei Liu,Hong-Qiang Yang,Li-Hua Bao

doi:10.1016/j.jtbi.2013.04.019

Abstract

The feature extraction of protein sequences is a challenging problem. It might need a lot of theoretical and practical knowledge from many fields. The difficulty would increase when investigators extract the features solely from protein sequences. In this paper, we present a method of protein granularity. The concepts of protein granularity, granularity order, granularity bound, granularity limit, and granularity increment are given respectively. The protein granularity can dig out the useful information solely from protein sequences. We provide an approach to construct the feature vectors. The feature vectors include the amino acid composition information, the sequence-order information, the same amino acid ‘neighbor’ information, and the sequence length information. Hence, the feature vectors can better represent protein sequences. Our feature extraction method does obviously consider the protein sequence length effects. An experiment of the protein structure class prediction was carried out. The prediction achieved 96.6% overall accuracy, and the success rate for each subset is all-α 92.3%, all-β 100%, α/β 100%, α+β 93.5%, respectively. The last three success rates for subsets are equal to the best success rates in the published literatures. The overall accuracy of PG-SVM prediction is the second best result only having one protein prediction error difference with the first best result. The theoretical and experimental results demonstrate the application of protein granularity succeeds in the feature extraction of protein sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using protein granularity to extract the protein sequence features

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology

Lead the way for us

Journal: Journal of Theoretical Biology	Publication Date: Apr 26, 2013
Citations: 4

Similar Papers

Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition
J.-Y Shi ... J Xie
Amino Acids | VOL. 33
J.-Y Shi, et. al.J.-Y Shi ... J Xie
19 Jan 2007
Amino Acids | VOL. 33

Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection
Bin Liu ... Xiaolong Wang
PLoS ONE | VOL. 7
Bin Liu, et. al.Bin Liu ... Xiaolong Wang
28 Sep 2012
PLoS ONE | VOL. 7

Discriminating lysosomal membrane protein types using dynamic neural network
Vijay Tripathi ... Dwijendra Kumar Gupta
Journal of Biomolecular Structure and Dynamics | VOL. 32
Vijay Tripathi, et. al.Vijay Tripathi ... Dwijendra Kumar Gupta
22 Aug 2013
Journal of Biomolecular Structure and Dynamics | VOL. 32

Structural class prediction of protein using novel feature extraction method from chaos game representation of predicted secondary structure
Lichao Zhang ... Jinfeng Lv
Journal of Theoretical Biology | VOL. 400
Lichao Zhang, et. al.Lichao Zhang ... Jinfeng Lv
12 Apr 2016
Journal of Theoretical Biology | VOL. 400

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using protein granularity to extract the protein sequence features

Abstract

Talk to us

Similar Papers

More From: Journal of Theoretical Biology