Abstract
Proteins are typically represented by discrete atomic coordinates providing an accessible framework to describe different conformations. However, in some fields proteins are more accurately represented as near-continuous surfaces, as these are imprinted with geometric (shape) and chemical (electrostatics) features of the underlying protein structure. Protein surfaces are dependent on their chemical composition and, ultimately determine protein function, acting as the interface that engages in interactions with other molecules. In the past, such representations were utilized to compare protein structures on global and local scales and have shed light on functional properties of proteins. Here we describe RosettaSurf, a surface-centric computational design protocol, that focuses on the molecular surface shape and electrostatic properties as means for protein engineering, offering a unique approach for the design of proteins and their functions. The RosettaSurf protocol combines the explicit optimization of molecular surface features with a global scoring function during the sequence design process, diverging from the typical design approaches that rely solely on an energy scoring function. With this computational approach, we attempt to address a fundamental problem in protein design related to the design of functional sites in proteins, even when structurally similar templates are absent in the characterized structural repertoire. Surface-centric design exploits the premise that molecular surfaces are, to a certain extent, independent of the underlying sequence and backbone configuration, meaning that different sequences in different proteins may present similar surfaces. We benchmarked RosettaSurf on various sequence recovery datasets and showcased its design capabilities by generating epitope mimics that were biochemically validated. Overall, our results indicate that the explicit optimization of surface features may lead to new routes for the design of functional proteins.
Highlights
Proteins are key components in living cells, performing many functions that commonly rely on physical interactions between molecules
The RosettaSurf protocol operates at the solvent-excluded surface level of a protein structure, and its core operation is the comparison between surfaces
We refer to the mutable surface as target while the reference surface used for comparisons is denoted as reference
Summary
Proteins are key components in living cells, performing many functions that commonly rely on physical interactions between molecules. While discrete atomic-level protein representations have been invaluable for our understanding of protein function, nearcontinuous surface-based representations offer the opportunity to study protein structures using a different subset of features (e.g. electrostatic potentials, geometry). In 1971, Lee and Richards introduced the concept of solvent-accessible surfaces, which in practice are generated by rolling a probe approximating a solvent molecule over the protein atoms [2]. Numerical representations of surfaces have been developed, ranging from dot surfaces, to voxel representations and graphs [1,8–12]. These representations allow the mapping of molecular and geometric properties onto the generated surface, including physicochemical properties (such as electrostatics and hydrophobicity), and geometrical features (e.g. protrusions or cavities) [6,8,13–15]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.