An efficient computational intelligence technique for classification of protein sequences

Muhammad Javed Iqbal,Ibrahima Faye,Abas Md Said,Brahim Belhaouari Samir

doi:10.1109/iccoins.2014.6868352

Abstract

Many artificial intelligence techniques have been developed to process the constantly increasing volume of data to extract meaningful information from it. The accurate annotation of the unknown protein using the classification of the protein sequence into an existing superfamily is considered a critical and challenging task in bioinformatics and computational biology. This classification would be helpful in the analysis and modeling of unknown protein to determine their structure and function. In this paper, a frequency-based feature encoding technique has been used in the proposed framework to represent amino acids of a protein's primary sequence. The technique has considered the occurrence frequency of each amino acid in a sequence. Popular classification algorithms such as decision tree, naive Bayes, neural network, random forest and support vector machine have been employed to evaluate the effectiveness of the encoding method utilized in the proposed framework. Results have indicated that the decision tree classifier significantly shows better results in terms of classification accuracy, specificity, sensitivity, F-measure, etc. The classification accuracy of 88.7% was achieved over the Yeast protein sequence data taken from the well-known UniProtKB database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An efficient computational intelligence technique for classification of protein sequences

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Data Mining of Protein Sequences with Amino Acid Position-Based Feature Encoding Technique
Muhammad Javed Iqbal ... Ibrahima Faye
-
Muhammad Javed Iqbal, et. al.Muhammad Javed Iqbal ... Ibrahima Faye
15 Dec 2013
15 Dec 2013

A novel semi-supervised approach for protein sequence classification
Bharti Chaturvedi ... Nagamma Patil
-
Bharti Chaturvedi, et. al.Bharti Chaturvedi ... Nagamma Patil
01 Jun 2015
01 Jun 2015

Predictive analysis for pathogenicity classification of H5Nx avian influenza strains using machine learning techniques.
Akshay Chadha ... Rozita Dara
Preventive Veterinary Medicine | VOL. 216
Akshay Chadha, et. al.Akshay Chadha ... Rozita Dara
01 Jul 2023
Preventive Veterinary Medicine | VOL. 216

IDM-PhyChm-Ens: Intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids
Safdar Ali ... Asifullah Khan
Amino Acids | VOL. 46
Safdar Ali, et. al.Safdar Ali ... Asifullah Khan
04 Jan 2014
Amino Acids | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An efficient computational intelligence technique for classification of protein sequences

Abstract

Talk to us

Similar Papers