Towards sequence-based prediction of mutation-induced stability changes in unseen non-homologous proteins

Lukas Folkman,Bela Stantic,Abdul Sattar

doi:10.1186/1471-2164-15-s1-s4

Lukas Folkman, Bela Stantic + Show 1 more

Open Access

https://doi.org/10.1186/1471-2164-15-s1-s4

Copy DOI

Journal: BMC Genomics	Publication Date: Jan 1, 2014
Citations: 44	License type: cc-by

Affiliation: Griffith University, Data61

Abstract

BackgroundReliable prediction of stability changes induced by a single amino acid substitution is an important aspect of computational protein design. Several machine learning methods capable of predicting stability changes from the protein sequence alone have been introduced. Prediction performance of these methods is evaluated on mutations unseen during training. Nevertheless, different mutations of the same protein, and even the same residue, as encountered during training are commonly used for evaluation. We argue that a faithful evaluation can be achieved only when a method is tested on previously unseen proteins with low sequence similarity to the training set.ResultsWe provided experimental evidence of the limitations of the evaluation commonly used for assessing the prediction performance. Furthermore, we demonstrated that the prediction of stability changes in previously unseen non-homologous proteins is a challenging task for currently available methods. To improve the prediction performance of our previously proposed method, we identified features which led to over-fitting and further extended the model with new features. The new method employs Evolutionary And Structural Encodings with Amino Acid parameters (EASE-AA). Evaluated with an independent test set of more than 600 mutations, EASE-AA yielded a Matthews correlation coefficient of 0.36 and was able to classify correctly 66% of the stabilising and 74% of the destabilising mutations. For real-value prediction, EASE-AA achieved the correlation of predicted and experimentally measured stability changes of 0.51.ConclusionsCommonly adopted evaluation with mutations in the same protein, and even the same residue, randomly divided between the training and test sets lead to an overestimation of prediction performance. Therefore, stability changes prediction methods should be evaluated only on mutations in previously unseen non-homologous proteins. Under such an evaluation, EASE-AA predicts stability changes more reliably than currently available methods.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-S1-S4) contains supplementary material, which is available to authorized users.

Highlights

Reliable prediction of stability changes induced by a single amino acid substitution is an important aspect of computational protein design
We compared the prediction performance of the two methods from the literature, I-Mutant2.0 [9] and MUpro [10], our previously proposed method [15], and the method designed in this study (EASEAA)
Predictive features and the improvements yielded by EASE-AA We found that EASE-AA consistently outperformed our previous work (EASE) when predicting mutations in unseen proteins

Summary

Introduction

Reliable prediction of stability changes induced by a single amino acid substitution is an important aspect of computational protein design. Several machine learning methods capable of predicting stability changes from the protein sequence alone have been introduced. Prediction performance of these methods is evaluated on mutations unseen during training. As more experimental data about stability changes became available in the ProTherm database [2], machine learning methods for predicting stability changes emerged. They can be categorised as structure-based and sequence-based methods. We focused our attention on the sequence-based methods

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards sequence-based prediction of mutation-induced stability changes in unseen non-homologous proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Sequence-only evolutionary and predicted structural features for the prediction of stability changes in protein mutants
Lukas Folkman ... Abdul Sattar
BMC bioinformatics | VOL. 14
Lukas Folkman, et. al.Lukas Folkman ... Abdul Sattar
01 Jan 2013
BMC bioinformatics | VOL. 14

Author response: Rapid protein stability prediction using deep learning representations
Lasse M Blaabjerg ... Lydia L Good
-
Lasse M Blaabjerg, et. al.Lasse M Blaabjerg ... Lydia L Good
09 May 2023
09 May 2023

PMSPcnn: Predicting protein stability changes upon single point mutations with convolutional neural network
Xiaohan Sun ... Jingjie Su
Structure (London, England : 1993) | VOL. 32
Xiaohan Sun, et. al.Xiaohan Sun ... Jingjie Su
01 Mar 2024
Structure (London, England : 1993) | VOL. 32

Transfer learning to leverage larger datasets for improved prediction of protein stability changes
Henry Dieckhaus ... Brian Kuhlman
Proceedings of the National Academy of Sciences of the United States of America | VOL. 121
Henry Dieckhaus, et. al.Henry Dieckhaus ... Brian Kuhlman
29 Jan 2024
Proceedings of the National Academy of Sciences of the United States of America | VOL. 121

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards sequence-based prediction of mutation-induced stability changes in unseen non-homologous proteins

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics