Toward Developing Intuitive Rules for Protein Variant Effect Prediction Using Deep Mutational Scanning Data.

Cheloor Kovilakam Sruthi,Meher K Prakash,Hemalatha Balaram

doi:10.1021/acsomega.0c02402

Cheloor Kovilakam Sruthi, Meher K Prakash + Show 1 more

Open Access

https://doi.org/10.1021/acsomega.0c02402

Copy DOI

Abstract

Protein structure and function can be severely altered by even a single amino acid mutation. Predictions of mutational effects using extensive artificial intelligence (AI)-based models, although accurate, remain as enigmatic as the experimental observations in terms of improving intuitions about the contributions of various factors. Inspired by Lipinski’s rules for drug-likeness, we devise simple thresholding criteria on five different descriptors such as conservation, which have so far been limited to qualitative interpretations such as high conservation implies high mutational effect. We analyze systematic deep mutational scanning data of all possible single amino acid substitutions on seven proteins (25153 mutations) to first define these thresholds and then to evaluate the scope and limits of the predictions. At this stage, the approach allows us to comment easily and with a low error rate on the subset of mutations classified as neutral or deleterious by all of the descriptors. We hope that complementary to the accurate AI predictions, these thresholding rules or their subsequent modifications will serve the purpose of codifying the knowledge about the effects of mutations.

Full Text