Abstract
Bioinformatics and biophysical studies of intrinsically disordered proteins and regions (IDRs) note the high entropy at individual sequence positions and in conformations sampled in solution. This prevents application of the canonical sequence-structure-function paradigm to IDRs and motivates the development of new methods to extract information from IDR sequences. We argue that the information in IDR sequences cannot be fully revealed through positional conservation, which largely measures stable structural contacts and interaction motifs. Instead, considerations of evolutionary conservation of molecular features can reveal the full extent of information in IDRs. Experimental quantification of the large conformational entropy of IDRs is challenging but can be approximated through the extent of conformational sampling measured by a combination of NMR spectroscopy and lower-resolution structural biology techniques, which can be further interpreted with simulations. Conformational entropy and other biophysical features can be modulated by post-translational modifications that provide functional advantages to IDRs by tuning their energy landscapes and enabling a variety of functional interactions and modes of regulation. The diverse mosaic of functional states of IDRs and their conformational features within complexes demands novel metrics of information, which will reflect the complicated sequence-conformational ensemble-function relationship of IDRs.
Highlights
We explore how conformational plasticity is utilized for the interactions of intrinsically disordered proteins and regions (IDRs) with their targets, and how it is impacted by post-translational modification and liquid-liquid phase separation
The apparent high entropy in the sequence space of IDRs points to the ineffectiveness of the positional sequence-structure paradigm for extracting functional information stored within IDRs
To fully appreciate the extent of information encoded in IDRs, we first must bypass the conventional measures of protein fold and search for new ways of extracting information
Summary
The concept of information is deeply rooted in the central dogma of molecular biology. Crick specified that information refers to “the precise determination of sequence, either of bases in the nucleic acid or of amino acid residues in the protein.” [1]. Since the discovery that the 3D structures of folded proteins are encoded in their amino acid sequence it has been understood that this genetic information translates to structure and function through a thermodynamic process that minimizes free energy, which for structured proteins. Entropy 2019, 21, 662 involves forming functional ordered conformations [3]. This structure-function relationship makes the determination of 3D structures of proteins a major focus of biological research [4] with structural information used to rationalize molecular mechanisms and therapeutic design [5]
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have