DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations

Ludovica Montanucci,Yotam Frank,Piero Fariselli,Emidio Capriotti,Nir Ben-Tal

doi:10.1186/s12859-019-2923-1

Ludovica Montanucci, Yotam Frank + Show 3 more

Open Access

PDF Available

https://doi.org/10.1186/s12859-019-2923-1

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BackgroundPredicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (∆∆G) between wild type and variant proteins, using sequence and structure information. Most of the available methods however do not exhibit the anti-symmetric prediction property, which guarantees that the predicted ∆∆G value for a variation is the exact opposite of that predicted for the reverse variation, i.e., ∆∆G(A → B) = −∆∆G(B → A), where A and B are amino acids.ResultsHere we introduce simple anti-symmetric features, based on evolutionary information, which are combined to define an untrained method, DDGun (DDG untrained). DDGun is a simple approach based on evolutionary information that predicts the ∆∆G for single and multiple variations from sequence and structure information (DDGun3D). Our method achieves remarkable performance without any training on the experimental datasets, reaching Pearson correlation coefficients between predicted and measured ∆∆G values of ~ 0.5 and ~ 0.4 for single and multiple site variations, respectively. Surprisingly, DDGun performances are comparable with those of state of the art methods. DDGun also naturally predicts multiple site variations, thereby defining a benchmark method for both single site and multiple site predictors. DDGun is anti-symmetric by construction predicting the value of the ∆∆G of a reciprocal variation as almost equal (depending on the sequence profile) to -∆∆G of the direct variation. This is a valuable property that is missing in the majority of the methods.ConclusionsEvolutionary information alone combined in an untrained method can achieve remarkably high performances in the prediction of ∆∆G upon protein mutation. Non-trained approaches like DDGun represent a valid benchmark both for scoring the predictive power of the individual features and for assessing the learning capability of supervised methods.

Highlights

Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function
The three following scores are based purely on sequence data: 1. the difference between the wild type and mutant residue in the Blosum62 substitution matrix; 2. the difference in the interaction energy between the wild-type and substituted residue with their sequence neighbours within a 2-residue window; 3. the difference in the hydrophobicity between wild type and mutant residues according to the Kyte-Doolittle scale
We need predictors that can be as good as predicting protein stability changes upon variations and at the same time obtaining opposite values for the reciprocal sequence changes that bring the mutated proteins back to their respective wild types

Summary

Introduction

Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (ΔΔG) between wild type and variant proteins, using sequence and structure information. Structure-based methods take advantage of the features representing the structural environment of the substituted residue. The combination of such features with physical and statistical potentials, improves the performance of the predictors [2]. In general, sequencebased predictors are less accurate than structure-based ones, some sequence-based methods, especially those exploiting evolutionary information, show comparable performances to structure-based tools [20]

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jul 1, 2019
Citations: 100	License type: open-access

R Discovery Prime

DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Author response: Rapid protein stability prediction using deep learning representations
Lasse M Blaabjerg ... Kristoffer E Johansson
-
Lasse M Blaabjerg, et. al.Lasse M Blaabjerg ... Kristoffer E Johansson
09 May 2023
09 May 2023

Prediction of protein stability changes for single‐site mutations using support vector machines
Jianlin Cheng ... Arlo Randall
Proteins: Structure, Function, and Bioinformatics | VOL. 62
Jianlin Cheng, et. al.Jianlin Cheng ... Arlo Randall
21 Dec 2005
Proteins: Structure, Function, and Bioinformatics | VOL. 62

DDGun: an untrained predictor of protein stability changes upon amino acid variants.
Ludovica Montanucci ... Silvia Benevenuta
Nucleic Acids Research | VOL. 50
Ludovica Montanucci, et. al.Ludovica Montanucci ... Silvia Benevenuta
07 May 2022
Nucleic Acids Research | VOL. 50

A Deep-Learning Sequence-Based Method to Predict Protein Stability Changes Upon Genetic Variations.
Corrado Pancotti ... Emidio Capriotti
Genes | VOL. 12
Corrado Pancotti, et. al.Corrado Pancotti ... Emidio Capriotti
12 Jun 2021
Genes | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Bioinformatics