Abstract

The complex cellular milieu can spontaneously demix, or phase separate, in a process controlled in part by intrinsically disordered (ID) proteins. A protein's propensity to phase separate is thought to be driven by a preference for protein–protein over protein–solvent interactions. The hydrodynamic size of monomeric proteins, as quantified by the polymer scaling exponent (v), is driven by a similar balance. We hypothesized that mean v, as predicted by protein sequence, would be smaller for proteins with a strong propensity to phase separate. To test this hypothesis, we analyzed protein databases containing subsets of proteins that are folded, disordered, or disordered and known to spontaneously phase separate. We find that the phase-separating disordered proteins, on average, had lower calculated values of v compared with their non-phase-separating counterparts. Moreover, these proteins had a higher sequence-predicted propensity for β-turns. Using a simple, surface area-based model, we propose a physical mechanism for this difference: transient β-turn structures reduce the desolvation penalty of forming a protein-rich phase and increase exposure of atoms involved in π/sp2 valence electron interactions. By this mechanism, β-turns could act as energetically favored nucleation points, which may explain the increased propensity for turns in ID regions (IDRs) utilized biologically for phase separation. Phase-separating IDRs, non-phase-separating IDRs, and folded regions could be distinguished by combining v and β-turn propensity. Finally, we propose a new algorithm, ParSe (partition sequence), for predicting phase-separating protein regions, and which is able to accurately identify folded, disordered, and phase-separating protein regions based on the primary sequence.

Highlights

  • Protein liquid-liquid phase separation (LLPS) is increasingly recognized as an important organizing phenomenon in cells

  • By normalizing hydrodynamic size to the chain length, the predicted polymer scaling exponent, v, provides a simple metric that reports on the net balance of self and solvent interactions

  • While there are real limitations to the applicability of applying concepts developed for long homopolymers to heteropolymeric proteins, numerous studies, including this work, support the view that properties, like v, derived for homopolymers, can be successfully applied to biological intrinsically disordered protein (IDP) to help understand their observed solution behavior [19, 24, 25, 31,32,33]

Read more

Summary

Introduction

Protein liquid-liquid phase separation (LLPS) is increasingly recognized as an important organizing phenomenon in cells. LLPS is a reversible process whereby complex protein mixtures spontaneously de-mix into liquid droplets that are enriched in a particular protein; concomitantly, surrounding regions are depleted of that protein [1]. This de-mixing transition is thought to provide temporal and spatial control over intracellular interactions by assembling collections of proteins into structures called membraneless organelles [2], a key step in the regulatory function of P bodies, the nucleolus, and germ granules [3,4,5]. In part, based on mechanistic insights into the nature of these interactions, several groups have developed sequence-based predictors to identify LLPS regions [10,11,12]

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call