A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins

Yu-Cheng Liu,Win-Li Lin,Meng-Han Yang,Chien-Kang Huang,Yen-Jen Oyang

doi:10.1186/1471-2164-10-s3-s22

Abstract

BackgroundProteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. One category of predictors attempts to identify conformationally flexible regions through analysis of protein tertiary structures. Another category of predictors works completely based on analysis of the polypeptide sequences. As the availability of protein tertiary structures is generally limited, the design of predictors that work completely based on sequence information is crucial for advances of molecular biology research.ResultsIn this article, we propose a novel approach to design a sequence-based predictor for identifying conformationally ambivalent regions in proteins. The novelty in the design stems from incorporating two classifiers based on two distinctive supervised learning algorithms that provide complementary prediction powers. Experimental results show that the overall performance delivered by the hybrid predictor proposed in this article is superior to the performance delivered by the existing predictors. Furthermore, the case study presented in this article demonstrates that the proposed hybrid predictor is capable of providing the biologists with valuable clues about the functional sites in a protein chain. The proposed hybrid predictor provides the users with two optional modes, namely, the high-sensitivity mode and the high-specificity mode. The experimental results with an independent testing data set show that the proposed hybrid predictor is capable of delivering sensitivity of 0.710 and specificity of 0.608 under the high-sensitivity mode, while delivering sensitivity of 0.451 and specificity of 0.787 under the high-specificity mode.ConclusionThough experimental results show that the hybrid approach designed to exploit the complementary prediction powers of distinctive supervised learning algorithms works more effectively than conventional approaches, there exists a large room for further improvement with respect to the achieved performance. In this respect, it is of interest to investigate the effects of exploiting additional physiochemical properties that are related to conformational ambivalence. Furthermore, it is of interest to investigate the effects of incorporating lately-developed machine learning approaches, e.g. the random forest design and the multi-stage design. As conformational transition plays a key role in carrying out several essential types of biological functions, the design of more advanced predictors for identifying conformationally ambivalent regions in proteins deserves our continuous attention.

Highlights

Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment
Though experimental results show that the hybrid approach designed to exploit the complementary prediction powers of distinctive supervised learning algorithms works more effectively than conventional approaches, there exists a large room for further improvement with respect to the achieved performance
As conformational transition plays a key role in carrying out several essential types of biological functions, the design of more advanced predictors for identifying conformationally ambivalent regions in proteins deserves our continuous attention

Summary

Introduction

Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. The GTPase HRas protein, whose gene serves as an oncogene of the bladder cancer, shows different conformations in the Switch II region when this protein switches between the RAS-GTP state and the RAS-GDP state [3,4,5,6]. The prion protein (PrP) causes the mad cow disease when a specific secondary structure element changes from a helix to a b-sheet [9]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Dec 1, 2009
Citations: 39	License type: cc-by

R Discovery Prime

R Discovery Prime

A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Phosphorylation-Induced Mechanical Regulation of Intrinsically Disordered Neurofilament Proteins
Eti Malka-Gibor ... Roy Beck
Biophysical Journal | VOL. 112
Eti Malka-Gibor, et. al.Eti Malka-Gibor ... Roy Beck
01 Mar 2017
Biophysical Journal | VOL. 112

Rigidity Analysis of Protein Molecules
Zahra Shahbazi ... Ahmet Demirtas
Journal of Computing and Information Science in Engineering | VOL. 15
Zahra Shahbazi, et. al.Zahra Shahbazi ... Ahmet Demirtas
01 Sep 2015
Journal of Computing and Information Science in Engineering | VOL. 15

Mitigating the Blurring Effect of CryoEM Averaging on a Flexible and Highly Symmetric Protein Complex through Sub-Particle Reconstruction.
Diana S Suder ... Shane Gonen
International journal of molecular sciences | VOL. 25
Diana S Suder, et. al.Diana S Suder ... Shane Gonen
23 May 2024
International journal of molecular sciences | VOL. 25

Protein loops with multiple meta‐stable conformations: A challenge for sampling and scoring methods
Amélie Barozet ... Marc Bianciotto
Proteins: Structure, Function, and Bioinformatics | VOL. 89
Amélie Barozet, et. al.Amélie Barozet ... Marc Bianciotto
12 Oct 2020
Proteins: Structure, Function, and Bioinformatics | VOL. 89

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics