Discrimination of Thermophilic and Mesophilic Proteins Using Reduced Amino Acid Alphabets with n-Grams

Aydin Albayrak,Ugur O Sezerman

doi:10.2174/157489312800604435

Abstract

Protein thermostabilization has been the focus of recent research due to growing interest in the production of enzymes that can operate at temperatures that are industrially beneficial. Understanding the determinants of thermostabilization at the level of sequence and structure is important to design such enzymes. A bioinformatical approach was used to determine the extent by which reduced amino acid alphabets (RAAA) with n-grams (subsequences of length n) that were subjected to a t-test-based feature selection procedure can be used to discriminate proteins from thermophiles and mesophiles. Classification performance of 65 different protein alphabets with 3 different n-gram sizes was systematically evaluated using support vector machines in a test set that contained 707 proteins from mesophilic Xylella fastidosa and thermophilic Aquifex aeolicus. A classification accuracy of 91.796% was achieved with Hsdm16 RAAA with 13 features: EK-ILV-ST-A-G-F-H-Q-N-R-M-W-Y. The t-test-based feature selection procedure reduced the classification time without significantly affecting classification accuracy. The overall combination of methods in this paper is useful and computationally fast for classifying protein sequences from thermophiles and mesophiles using sequence information alone.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discrimination of Thermophilic and Mesophilic Proteins Using Reduced Amino Acid Alphabets with n-Grams

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics

Lead the way for us

Journal: Current Bioinformatics	Publication Date: May 1, 2012
Citations: 5

Similar Papers

Statistical Analysis of the Role of Cavity Flexibility in Thermostability of Proteins.
So Yeon Hong ... Jeong Chan Joo
Polymers | VOL. 16
So Yeon Hong, et. al.So Yeon Hong ... Jeong Chan Joo
21 Jan 2024
Polymers | VOL. 16

Identifying the Mesophilic and Thermophilic Proteins from Their Amino Acid Composition with ν-Support Vector Machines
Y R Ding ... J Sun
Journal of Algorithms & Computational Technology | VOL. 4
Y R Ding, et. al.Y R Ding ... J Sun
01 Sep 2010
Journal of Algorithms & Computational Technology | VOL. 4

Discrimination of Thermophilic and Mesophilic Proteins via Artificial Neural Networks
Jingru Xu ... Yuehui Chen
-
Jingru Xu, et. al.Jingru Xu ... Yuehui Chen
01 Jan 2010
01 Jan 2010

Effective factors in thermostability of thermophilic proteins
M Sadeghi ... B Ranjbar
Biophysical Chemistry | VOL. 119
M Sadeghi, et. al.M Sadeghi ... B Ranjbar
25 Oct 2005
Biophysical Chemistry | VOL. 119

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discrimination of Thermophilic and Mesophilic Proteins Using Reduced Amino Acid Alphabets with n-Grams

Abstract

Talk to us

Similar Papers

More From: Current Bioinformatics