Quantification of biases in predictions of protein-protein binding affinity changes upon mutations.

Matsvei Tsishyn,Marianne Rooman,Fabrizio Pucci

doi:10.1093/bib/bbad491

Abstract

Understanding the impact of mutations on protein-protein binding affinity is a key objective for a wide range of biotechnological applications and for shedding light on disease-causing mutations, which are often located at protein-protein interfaces. Over the past decade, many computational methods using physics-based and/or machine learning approaches have been developed to predict how protein binding affinity changes upon mutations. They all claim to achieve astonishing accuracy on both training and test sets, with performances on standard benchmarks such as SKEMPI 2.0 that seem overly optimistic. Here we benchmarked eight well-known and well-used predictors and identified their biases and dataset dependencies, using not only SKEMPI 2.0 as a test set but also deep mutagenesis data on the severe acute respiratory syndrome coronavirus 2 spike protein in complex with the human angiotensin-converting enzyme 2. We showed that, even though most of the tested methods reach a significant degree of robustness and accuracy, they suffer from limited generalizability properties and struggle to predict unseen mutations. Interestingly, the generalizability problems are more severe for pure machine learning approaches, while physics-based methods are less affected by this issue. Moreover, undesirable prediction biases toward specific mutation properties, the most marked being toward destabilizing mutations, are also observed and should be carefully considered by method developers. We conclude from our analyses that there is room for improvement in the prediction models and suggest ways to check, assess and improve their generalizability and robustness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Briefings in Bioinformatics	Publication Date: Nov 22, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Quantification of biases in predictions of protein-protein binding affinity changes upon mutations.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Similar Papers

Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2.
Wenhui Li ... Wayne A Marasco
The EMBO Journal | VOL. 24
Wenhui Li, et. al.Wenhui Li ... Wayne A Marasco
24 Mar 2005
The EMBO Journal | VOL. 24

Mechanisms of Host Receptor Adaptation by Severe Acute Respiratory Syndrome Coronavirus
Kailang Wu ... Fang Li
Journal of Biological Chemistry | VOL. 287
Kailang Wu, et. al.Kailang Wu ... Fang Li
01 Mar 2012
Journal of Biological Chemistry | VOL. 287

PANDA: Predicting the change in proteins binding affinity upon mutations by finding a signal in primary structures.
Wajid Arshad Abbasi ... Syed Ali Abbas
Journal of Bioinformatics and Computational Biology | VOL. 19
Wajid Arshad Abbasi, et. al.Wajid Arshad Abbasi ... Syed Ali Abbas
11 Jun 2021
Journal of Bioinformatics and Computational Biology | VOL. 19

Computational prediction of the effect of amino acid changes on the binding affinity between SARS-CoV-2 spike RBD and human ACE2
Chen Chen ... Costas D Maranas
Proceedings of the National Academy of Sciences | VOL. 118
Chen Chen, et. al.Chen Chen ... Costas D Maranas
29 Sep 2021
Proceedings of the National Academy of Sciences | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantification of biases in predictions of protein-protein binding affinity changes upon mutations.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics