Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours.

John M Jumper,Nabil F Faruk,Tobin R Sosnick,Karl F Freed

doi:10.1371/journal.pcbi.1006578

John M Jumper, Nabil F Faruk + Show 2 more

Open Access

https://doi.org/10.1371/journal.pcbi.1006578

Copy DOI

Journal: PLOS Computational Biology	Publication Date: Dec 27, 2018
Citations: 40	License type: CC BY 4.0

Affiliation: University of Chicago

Abstract

An ongoing challenge in protein chemistry is to identify the underlying interaction energies that capture protein dynamics. The traditional trade-off in biomolecular simulation between accuracy and computational efficiency is predicated on the assumption that detailed force fields are typically well-parameterized, obtaining a significant fraction of possible accuracy. We re-examine this trade-off in the more realistic regime in which parameterization is a greater source of error than the level of detail in the force field. To address parameterization of coarse-grained force fields, we use the contrastive divergence technique from machine learning to train from simulations of 450 proteins. In our procedure, the computational efficiency of the model enables high accuracy through the precise tuning of the Boltzmann ensemble. This method is applied to our recently developed Upside model, where the free energy for side chains is rapidly calculated at every time-step, allowing for a smooth energy landscape without steric rattling of the side chains. After this contrastive divergence training, the model is able to de novo fold proteins up to 100 residues on a single core in days. This improved Upside model provides a starting point both for investigation of folding dynamics and as an inexpensive Bayesian prior for protein physics that can be integrated with additional experimental or bioinformatic data.

Highlights

Since Anfinsen’s original demonstration that a protein’s sequence determines its structure, multiple computational strategies have been developed to predict a protein’s structure from its sequence
Allatom, explicit solvent methods have become successful for the folding of some small proteins, the ability to replicate the properties outside the native basin requires substantial improvement [4]
We demonstrate that we can achieve de novo folding for a diverse collection of proteins by combining our fast-equilibrating Upside model with a contrastive divergence procedure that optimizes the stability of the native well

Summary

Introduction

Since Anfinsen’s original demonstration that a protein’s sequence determines its structure, multiple computational strategies have been developed to predict a protein’s structure from its sequence. An additional facet of this challenge is to replicate the energy landscape that defines both the folding process and other dynamical properties. In the absence of other information, coarse-grained models with one or a few beads per residue are too simplistic for de novo structure prediction. Cβ level models having authentic protein backbones with φ/ψ dihedral angles, but lacking side chain rotamers, have achieved some success [1,2,3]. Allatom, explicit solvent methods have become successful for the folding of some small proteins, the ability to replicate the properties outside the native basin requires substantial improvement [4]. It is unclear which representation provides the optimal combination of detail and computational expense to replicate protein folding and dynamics. Integral to the choice of representation is which interactions to include, such as hydrogen bonding, van der Waals interactions and hydrophobic burial

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Upside: A New Dynamics Methods Capable of Cooperative De Novo Protein Folding in CPU-Hours
John M Jumper ... Tobin R Sosnick
Biophysical Journal | VOL. 114
John M Jumper, et. al.John M Jumper ... Tobin R Sosnick
01 Feb 2018
Biophysical Journal | VOL. 114

Force field development for organic molecules: modifying dihedral and 1-n pair interaction parameters.
Siyan Chen ... Shasha Yi
Journal of Computational Chemistry | VOL. 36
Siyan Chen, et. al.Siyan Chen ... Shasha Yi
08 Dec 2014
Journal of Computational Chemistry | VOL. 36

Modeling, Simulations, and Bioinformatics at the Service of RNA Structure
Pablo D Dans ... Modesto Orozco
Chem | VOL. 5
Pablo D Dans, et. al.Pablo D Dans ... Modesto Orozco
18 Oct 2018
Chem | VOL. 5

Force fields for monovalent and divalent metal cations in TIP3P water based on thermodynamic and kinetic properties.
Shavkat Mamatkulov ... Nadine Schwierz
The Journal of Chemical Physics | VOL. 148
Shavkat Mamatkulov, et. al.Shavkat Mamatkulov ... Nadine Schwierz
21 Feb 2018
The Journal of Chemical Physics | VOL. 148

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Trajectory-based training enables protein simulations with accurate folding and Boltzmann ensembles in cpu-hours.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology