A critical analysis of computational protein design with sparse residue interaction graphs.

Swati Jain,Bruce R Donald,Ivelin S Georgiev,Jonathan D Jou

doi:10.1371/journal.pcbi.1005346

Swati Jain, Bruce R Donald + Show 2 more

Open Access

https://doi.org/10.1371/journal.pcbi.1005346

Copy DOI

Journal: PLOS Computational Biology	Publication Date: Mar 30, 2017
Citations: 2	License type: CC BY 4.0

Affiliation: Duke University, Duke Medical Center

Abstract

Protein design algorithms enumerate a combinatorial number of candidate structures to compute the Global Minimum Energy Conformation (GMEC). To efficiently find the GMEC, protein design algorithms must methodically reduce the conformational search space. By applying distance and energy cutoffs, the protein system to be designed can thus be represented using a sparse residue interaction graph, where the number of interacting residue pairs is less than all pairs of mutable residues, and the corresponding GMEC is called the sparse GMEC. However, ignoring some pairwise residue interactions can lead to a change in the energy, conformation, or sequence of the sparse GMEC vs. the original or the full GMEC. Despite the widespread use of sparse residue interaction graphs in protein design, the above mentioned effects of their use have not been previously analyzed. To analyze the costs and benefits of designing with sparse residue interaction graphs, we computed the GMECs for 136 different protein design problems both with and without distance and energy cutoffs, and compared their energies, conformations, and sequences. Our analysis shows that the differences between the GMECs depend critically on whether or not the design includes core, boundary, or surface residues. Moreover, neglecting long-range interactions can alter local interactions and introduce large sequence differences, both of which can result in significant structural and functional changes. Designs on proteins with experimentally measured thermostability show it is beneficial to compute both the full and the sparse GMEC accurately and efficiently. To this end, we show that a provable, ensemble-based algorithm can efficiently compute both GMECs by enumerating a small number of conformations, usually fewer than 1000. This provides a novel way to combine sparse residue interaction graphs with provable, ensemble-based algorithms to reap the benefits of sparse residue interaction graphs while avoiding their potential inaccuracies.

Highlights

Computational structure-based protein design is an emerging field with many applications in basic science and biomedical research [1]
Our results show that commonly used distance cutoffs can return a Global Minimum Energy Conformation (GMEC) whose sequence is different than that of the GMEC returned without those cutoffs
In the examples where the full GMEC predicts mutations that correlate with the thermostable mutant but the sparse GMEC does not, key high-energy, long-range pairwise interactions are consistently omitted from the sparse residue interaction graph and this failure to account for long-range interactions changes the sequence of the sparse GMEC

Summary

Introduction

Computational structure-based protein design is an emerging field with many applications in basic science and biomedical research [1]. Protein sequences have been designed to fold to specific tertiary structures [2,3,4,5]. Antibodies and nanobodies have been developed for therapeutic purposes by designing protein-protein and protein-ligand interfaces [14,15,16,17,18,19]. Negative design has been used for predicting resistance mutations for highly drug-resistant pathogens [20, 21]. The above mentioned studies are examples of the predictive power of computational protein design algorithms

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A critical analysis of computational protein design with sparse residue interaction graphs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Rotamer optimization for protein design through MAP estimation and problem‐size reduction
Eun‐Jong Hong ... Shaun M Lippow
Journal of Computational Chemistry | VOL. 30
Eun‐Jong Hong, et. al.Eun‐Jong Hong ... Shaun M Lippow
02 Jan 2009
Journal of Computational Chemistry | VOL. 30

Computational Protein Design Using AND/OR Branch-and-Bound Search
Yichao Zhou ... Jianyang Zeng
-
Yichao Zhou, et. al.Yichao Zhou ... Jianyang Zeng
01 Jan 2015
01 Jan 2015

Computational Protein Design Using AND/OR Branch-and-Bound Search.
Yichao Zhou ... Yuexin Wu
Journal of computational biology : a journal of computational molecular cell biology | VOL. 23
Yichao Zhou, et. al.Yichao Zhou ... Yuexin Wu
11 May 2016
Journal of computational biology : a journal of computational molecular cell biology | VOL. 23

Comparing three stochastic search algorithms for computational protein design: Monte Carlo, replica exchange Monte Carlo, and a multistart, steepest-descent heuristic.
David Mignon ... Thomas Simonson
Journal of computational chemistry | VOL. 37
David Mignon, et. al.David Mignon ... Thomas Simonson
20 May 2016
Journal of computational chemistry | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A critical analysis of computational protein design with sparse residue interaction graphs.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology