Abstract

Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call “rigid” (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions.

Highlights

  • Crystallization studies on myoglobin found no apparent way the oxygen could possibly enter the molecule and bind to heme [1]

  • We have found that the distribution of conformational diversity in a large dataset of proteins could be explained in terms of three sets sharing structure-based features emerging from the conformer population for each protein

  • Using several structural and dynamical analyses over the available population of conformers for each protein, we found that three main structure-function relationships emerge

Read more

Summary

Introduction

Crystallization studies on myoglobin found no apparent way the oxygen could possibly enter the molecule and bind to heme [1]. RMSD or other structural similarity scores measure the differences in ordered parts of the proteins and stress the importance of protein motions in the known protein structure space From this distribution, it is possible to infer how a great majority of proteins have RMSD values compatible with the accepted error in estimating a structure using X-ray crystallography (about 0.4 Å). Comparisons of apo structures of the same protein show an RMSD of 0.5 Å, a value slightly below the observed apo and substrate-bound average In agreement with these results, large-scale protein motions are not necessary to sustain biological function in the majority of proteins studied. This observation is supported with the finding that even small changes between conformers could greatly affect catalytic parameters and biological behaviour of enzymes [13,14]. High RMSD values, are commonly observed in multidomain proteins where hinge motions produce relative movements of domains as rigid bodies [17]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call