Variant effect predictions capture some aspects of deep mutational scanning experiments

Jonas Reeb,Burkhard Rost,Theresa Wirth

doi:10.1186/s12859-020-3439-4

Abstract

BackgroundDeep mutational scanning (DMS) studies exploit the mutational landscape of sequence variation by systematically and comprehensively assaying the effect of single amino acid variants (SAVs; also referred to as missense mutations, or non-synonymous Single Nucleotide Variants – missense SNVs or nsSNVs) for particular proteins. We assembled SAV annotations from 22 different DMS experiments and normalized the effect scores to evaluate variant effect prediction methods. Three trained on traditional variant effect data (PolyPhen-2, SIFT, SNAP2), a regression method optimized on DMS data (Envision), and a naïve prediction using conservation information from homologs.ResultsOn a set of 32,981 SAVs, all methods captured some aspects of the experimental effect scores, albeit not the same. Traditional methods such as SNAP2 correlated slightly more with measurements and better classified binary states (effect or neutral). Envision appeared to better estimate the precise degree of effect. Most surprising was that the simple naïve conservation approach using PSI-BLAST in many cases outperformed other methods. All methods captured beneficial effects (gain-of-function) significantly worse than deleterious (loss-of-function). For the few proteins with multiple independent experimental measurements, experiments differed substantially, but agreed more with each other than with predictions.ConclusionsDMS provides a new powerful experimental means of understanding the dynamics of the protein sequence space. As always, promising new beginnings have to overcome challenges. While our results demonstrated that DMS will be crucial to improve variant effect prediction methods, data diversity hindered simplification and generalization.

Highlights

Deep mutational scanning (DMS) studies exploit the mutational landscape of sequence variation by systematically and comprehensively assaying the effect of single amino acid variants (SAVs; referred to as missense mutations, or non-synonymous Single Nucleotide Variants – missense SNVs or nsSNVs) for particular proteins
Through the increased efficiency of sequencing, a procedure formerly used primarily in silico [11, 12] has become feasible for experiments, namely assessing the effect of all possible SAVs in a protein, i.e. all possible amino acid mutations
DMS studies not complete yet Our Deep Mutational Scanning (DMS) analyses began with 22 separate experimental datasets from 18 unique proteins, since some experiments were performed on the same protein (Supplementary Online Material (SOM), Fig. S1a, Table S1) [29, 30, 32, 50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65]

Summary

Introduction

Deep mutational scanning (DMS) studies exploit the mutational landscape of sequence variation by systematically and comprehensively assaying the effect of single amino acid variants (SAVs; referred to as missense mutations, or non-synonymous Single Nucleotide Variants – missense SNVs or nsSNVs) for particular proteins. Through the increased efficiency of sequencing, a procedure formerly used primarily in silico [11, 12] has become feasible for experiments, namely assessing the effect of all possible SAVs in a protein, i.e. all possible amino acid mutations. In such deep mutational scanning (DMS) studies [13, 14], a sequence library with all possible variants is subjected to selection. DMS aims at measuring the landscape of functional fitness for select proteins [18]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Mar 17, 2020
Citations: 32	License type: open-access

R Discovery Prime

R Discovery Prime

Variant effect predictions capture some aspects of deep mutational scanning experiments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates.
Tiaan Heunis ... Robin M Warren
Journal of Proteome Research | VOL. 16
Tiaan Heunis, et. al.Tiaan Heunis ... Robin M Warren
11 Sep 2017
Journal of Proteome Research | VOL. 16

SAAVpedia: Identification, Functional Annotation, and Retrieval of Single Amino Acid Variants for Proteogenomic Interpretation.
Soo Youn Lee ... Jong Shin Yoo
Journal of proteome research | VOL. 18
Soo Youn Lee, et. al.Soo Youn Lee ... Jong Shin Yoo
15 Oct 2019
Journal of proteome research | VOL. 18

Genetic evidence for predisposition to acute leukemias due to a missense mutation (p.Ser518Arg) in ZAP70 kinase: a case-control study
Khalil Khashei Varnamkhasti ... Sirous Naeimi
BMC Medical Genomics | VOL. 17
Khalil Khashei Varnamkhasti, et. al.Khalil Khashei Varnamkhasti ... Sirous Naeimi
07 Aug 2024
BMC Medical Genomics | VOL. 17

Exhaustive non-synonymous variants functionality prediction enables high resolution characterization of the neurofibromin architecture.
Ofer Isakov ... Shay Ben-Shachar
EBioMedicine | VOL. 36
Ofer Isakov, et. al.Ofer Isakov ... Shay Ben-Shachar
28 Sep 2018
EBioMedicine | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Variant effect predictions capture some aspects of deep mutational scanning experiments

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics