Probabilistic sampling of protein conformations: New hope for brute force?

Howard J Feldman,Christopher W.V Hogue

doi:10.1002/prot.1163

Abstract

Protein structure prediction from sequence alone by "brute force" random methods is a computationally expensive problem. Estimates have suggested that it could take all the computers in the world longer than the age of the universe to compute the structure of a single 200-residue protein. Here we investigate the use of a faster version of our FOLDTRAJ probabilistic all-atom protein-structure-sampling algorithm. We have improved the method so that it is now over twenty times faster than originally reported, and capable of rapidly sampling conformational space without lattices. It uses geometrical constraints and a Leonard-Jones type potential for self-avoidance. We have also implemented a novel method to add secondary structure-prediction information to make protein-like amounts of secondary structure in sampled structures. In a set of 100,000 probabilistic conformers of 1VII, 1ENH, and 1PMC generated, the structures with smallest Calpha RMSD from native are 3.95, 5.12, and 5.95A, respectively. Expanding this test to a set of 17 distinct protein folds, we find that all-helical structures are "hit" by brute force more frequently than beta or mixed structures. For small helical proteins or very small non-helical ones, this approach should have a "hit" close enough to detect with a good scoring function in a pool of several million conformers. By fitting the distribution of RMSDs from the native state of each of the 17 sets of conformers to the extreme value distribution, we are able to estimate the size of conformational space for each. With a 0.5A RMSD cutoff, the number of conformers is roughly 2N where N is the number of residues in the protein. This is smaller than previous estimates, indicating an average of only two possible conformations per residue when sterics are accounted for. Our method reduces the effective number of conformations available at each residue by probabilistic bias, without requiring any particular discretization of residue conformational space, and is the fastest method of its kind. With computer speeds doubling every 18 months and parallel and distributed computing becoming more practical, the brute force approach to protein structure prediction may yet have some hope in the near future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Probabilistic sampling of protein conformations: New hope for brute force?

Abstract

Talk to us

Similar Papers

More From: Proteins: Structure, Function, and Bioinformatics

Lead the way for us

Journal: Proteins: Structure, Function, and Bioinformatics	Publication Date: Nov 15, 2001
Citations: 108

Similar Papers

Balancing multiple objectives in conformation sampling to control decoy diversity in template-free protein structure prediction
Ahmed Bin Zaman ... Amarda Shehu
BMC Bioinformatics | VOL. 20
Ahmed Bin Zaman, et. al.Ahmed Bin Zaman ... Amarda Shehu
25 Apr 2019
BMC Bioinformatics | VOL. 20

A Hybrid Scheme to Solve the Protein Structure Prediction Problem
José C Calvo ... Julio Ortega
-
José C Calvo, et. al.José C Calvo ... Julio Ortega
01 Jan 2009
01 Jan 2009

Protein structure prediction with the UNRES force‐field using Replica‐Exchange Monte Carlo‐with‐Minimization; Comparison with MCM, CSA, and CFMC
Marian Nanias ... Harold A Scheraga
Journal of Computational Chemistry | VOL. 26
Marian Nanias, et. al.Marian Nanias ... Harold A Scheraga
08 Aug 2005
Journal of Computational Chemistry | VOL. 26

Space Partitioning for Scalable K-Means
David Pettinger ... Giuseppe Di Fatta
-
David Pettinger, et. al.David Pettinger ... Giuseppe Di Fatta
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Probabilistic sampling of protein conformations: New hope for brute force?

Abstract

Talk to us

Similar Papers

More From: Proteins: Structure, Function, and Bioinformatics