Abstract

The focus of the computational structural biology community has taken a dramatic shift over the past one-and-a-half decades from the classical protein structure prediction problem to the possible understanding of intrinsically disordered proteins (IDP) or proteins containing regions of disorder (IDPR). The current interest lies in the unraveling of a disorder-to-order transitioning code embedded in the amino acid sequences of IDPs/IDPRs. Disordered proteins are characterized by an enormous amount of structural plasticity which makes them promiscuous in binding to different partners, multi-functional in cellular activity and atypical in folding energy landscapes resembling partially folded molten globules. Also, their involvement in several deadly human diseases (e.g. cancer, cardiovascular and neurodegenerative diseases) makes them attractive drug targets, and important for a biochemical understanding of the disease(s). The study of the structural ensemble of IDPs is rather difficult, in particular for transient interactions. When bound to a structured partner, an IDPR adapts an ordered conformation in the complex. The residues that undergo this disorder-to-order transition are called protean residues, generally found in short contiguous stretches and the first step in understanding the modus operandi of an IDP/IDPR would be to predict these residues. There are a few available methods which predict these protean segments from their amino acid sequences; however, their performance reported in the literature leaves clear room for improvement. With this background, the current study presents ‘Proteus’, a random forest classifier that predicts the likelihood of a residue undergoing a disorder-to-order transition upon binding to a potential partner protein. The prediction is based on features that can be calculated using the amino acid sequence alone. Proteus compares favorably with existing methods predicting twice as many true positives as the second best method (55 vs. 27%) with a much higher precision on an independent data set. The current study also sheds some light on a possible ‘disorder-to-order’ transitioning consensus, untangled, yet embedded in the amino acid sequence of IDPs. Some guidelines have also been suggested for proceeding with a real-life structural modeling involving an IDPR using Proteus.

Highlights

  • After extensive research over one-and-a-half decades, it is evident that many functional proteins lack well-folded 3D structures

  • A comparative study of these properties in predicted disordered and annotated protean segments will serve to explore and identify empirical trends in the designed features and thereby act as a guide in determining the features that are more discriminative compared to the features that can act as filters

  • The results indicate that the intrinsic disorder associated with the unbound protean segments potentially suffers from the indecisiveness of the mainchain trajectory to adapt a particular secondary structure

Read more

Summary

Introduction

After extensive research over one-and-a-half decades, it is evident that many functional proteins lack well-folded 3D structures. J Comput Aided Mol Des (2017) 31:453–466 concomitantly while being synthesized [7, 8], these proteins are born disordered [3] and remain either completely or partially unstructured throughout their entire life span It is only when they interact with functionally relevant binding partners that they switch to ordered structures [4]. It is highly likely that these peculiar characteristics may be attributed to their non-native-like multi-funneled and relatively flat energy landscapes [12, 13], wherein the favored conformations closely resemble to the partially folded molten globules [13] which enable them to preserve the necessary amount of disorder even in their bound forms [4] Considering this flexible nature, they have been referred to as part of the ‘edge of chaos’ systems [14], serving as a bridge between well-ordered and chaotic system that is critical in the context of cellular energy balance

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call