Relevant and Non-Redundant Amino Acid Sequence Selection for Protein Functional Site Identification

Chandra Das,Pradipta Maji

doi:10.4018/jssci.2010040102

Abstract

In order to apply a powerful pattern recognition algorithm to predict functional sites in proteins, amino acids cannot be used directly as inputs since they are non-numerical variables. Therefore, they need encoding prior to input. In this regard, the bio-basis function maps a non-numerical sequence space to a numerical feature space. One of the important issues for the bio-basis function is how to select a minimum set of bio-basis strings with maximum information. In this paper, an efficient method to select bio-basis strings for the bio-basis function is described integrating the concepts of the Fisher ratio and “degree of resemblance”. The integration enables the method to select a minimum set of most informative bio-basis strings. The “degree of resemblance” enables efficient selection of a set of distinct bio-basis strings. In effect, it reduces the redundant features in numerical feature space. Quantitative indices are proposed for evaluating the quality of selected bio-basis strings. The effectiveness of the proposed bio-basis string selection method, along with a comparison with existing methods, is demonstrated on different data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Relevant and Non-Redundant Amino Acid Sequence Selection for Protein Functional Site Identification

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Science and Computational Intelligence

Lead the way for us

Similar Papers

Efficient Design of Bio-Basis Function to Predict Protein Functional Sites Using Kernel-Based Classifiers
P Maji ... C Das
IEEE Transactions on NanoBioscience | VOL. 9
P Maji, et. al.P Maji ... C Das
30 Sep 2010
IEEE Transactions on NanoBioscience | VOL. 9

Protein Functional Sites Prediction Using Modified Bio-Basis Function and Quantitative Indices
P Maji ... C Das
IEEE Transactions on NanoBioscience | VOL. 9
P Maji, et. al.P Maji ... C Das
01 Dec 2010
IEEE Transactions on NanoBioscience | VOL. 9

Rough-Fuzzy C-Medoids Algorithm and Selection of Bio-Basis for Amino Acid Sequence Analysis
...
IEEE Transactions on Knowledge and Data Engineering | VOL. 19
, et. al. ...
01 Jun 2007
IEEE Transactions on Knowledge and Data Engineering | VOL. 19

Prediction of Protein Functional Sites Using Novel String Kernels
Chandra Das ... Pradipta Maji
-
Chandra Das, et. al.Chandra Das ... Pradipta Maji
01 Dec 2008
01 Dec 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relevant and Non-Redundant Amino Acid Sequence Selection for Protein Functional Site Identification

Abstract

Talk to us

Similar Papers

More From: International Journal of Software Science and Computational Intelligence