Weighted amino acid composition based on amino acid indices for prediction of protein structural classes

Sundeep Singh Nanuwa,Huseyin Seker,Andre Dziurla

doi:10.1109/itab.2009.5394398

Abstract

Prediction of protein structural classes is one of the most important and challenging tasks in the bioinformatics field. A protein is classified into one of the four main types of protein structural classes; all-α, all-β, α/β and α+β. This paper investigates the role of amino acid indices (AAI) combined with traditional amino acid composition (AAC) to create a weighted amino acid composition (WAAC) feature-set to predict the structural class of a protein. There are over 500 amino acid indices that can be used to develop the novel weighted amino acid composition feature-set which has a great potential of increasing accuracy for the prediction of protein structural classes. For evaluation of these indices a high quality 40% homology dataset is used that contains over 7000 protein sequences (the largest of its kind) extracted from proteomic databases. The predictive technique developed is an optimum k-nearest-neighbour classifier, named multiple-k-nearest-neighbour (MKNN). In order to evaluate the classifier a 10- fold cross-validation test procedure is used throughout the study. Over 1 million analyses were carried out, the highest accuracy obtained was from index LEVM780101 at 48.35%, which is 9% higher than traditional AAC and 6.6% higher than that of the best sequence-driven-feature sub-set used in other studies. There is great potential for further improvement as WAAC is a feature-set with the least number of attributes without any feature selection and the numbers of indices that yielded higher accuracies than traditional AAC and other sequence-driven-features are 536 and 435, respectively, out of the 548 amino acid indices analysed in this study.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Weighted amino acid composition based on amino acid indices for prediction of protein structural classes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Investigation into effectiveness of rough sets in prediction of enzyme and protein structure classes
Chris Newby ... Yingjie Yang
-
Chris Newby, et. al.Chris Newby ... Yingjie Yang
01 Jun 2009
01 Jun 2009

Prediction of protein structural class for low-similarity sequences using Chou’s pseudo amino acid composition and wavelet denoising
Bin Yu ... Baoguang Tian
Journal of Molecular Graphics and Modelling | VOL. 76
Bin Yu, et. al.Bin Yu ... Baoguang Tian
14 Jul 2017
Journal of Molecular Graphics and Modelling | VOL. 76

Prediction of protein (domain) structural classes based on amino-acid index.
Wei‐Shu Bu ... Chun‐Ting Zhang
European Journal of Biochemistry | VOL. 266
Wei‐Shu Bu, et. al.Wei‐Shu Bu ... Chun‐Ting Zhang
15 Dec 1999
European Journal of Biochemistry | VOL. 266

Prediction of protein structural classes by a new measure of information discrepancy
Lixia Jin ... Huanwen Tang
Computational Biology and Chemistry | VOL. 27
Lixia Jin, et. al.Lixia Jin ... Huanwen Tang
01 Jul 2003
Computational Biology and Chemistry | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Weighted amino acid composition based on amino acid indices for prediction of protein structural classes

Abstract

Talk to us

Similar Papers