Abstract

An important goal in molecular biology is to understand functional changes upon single-point mutations in proteins. Doing so through a detailed characterization of structure spaces and underlying energy landscapes is desirable but continues to challenge methods based on Molecular Dynamics. In this paper we propose a novel algorithm, SIfTER, which is based instead on stochastic optimization to circumvent the computational challenge of exploring the breadth of a protein’s structure space. SIfTER is a data-driven evolutionary algorithm, leveraging experimentally-available structures of wildtype and variant sequences of a protein to define a reduced search space from where to efficiently draw samples corresponding to novel structures not directly observed in the wet laboratory. The main advantage of SIfTER is its ability to rapidly generate conformational ensembles, thus allowing mapping and juxtaposing landscapes of variant sequences and relating observed differences to functional changes. We apply SIfTER to variant sequences of the H-Ras catalytic domain, due to the prominent role of the Ras protein in signaling pathways that control cell proliferation, its well-studied conformational switching, and abundance of documented mutations in several human tumors. Many Ras mutations are oncogenic, but detailed energy landscapes have not been reported until now. Analysis of SIfTER-computed energy landscapes for the wildtype and two oncogenic variants, G12V and Q61L, suggests that these mutations cause constitutive activation through two different mechanisms. G12V directly affects binding specificity while leaving the energy landscape largely unchanged, whereas Q61L has pronounced, starker effects on the landscape. An implementation of SIfTER is made available at http://www.cs.gmu.edu/~ashehu/?q=OurTools. We believe SIfTER is useful to the community to answer the question of how sequence mutations affect the function of a protein, when there is an abundance of experimental structures that can be exploited to reconstruct an energy landscape that would be computationally impractical to do via Molecular Dynamics.

Highlights

  • Mutations in protein sequences that lead to altered functions have been found to drive or participate in many human diseases [1, 2]

  • Despite significant investigation in silico via methods based on Molecular Dynamics, details are missing on how mutations affect the ability of Ras to access the states it needs to perform its biological activity

  • We provide the algorithm for the research community to allow further investigation of the open question on how mutations to the sequence of a protein affect biological activity

Read more

Summary

Introduction

Mutations in protein sequences that lead to altered functions have been found to drive or participate in many human diseases [1, 2]. An important goal of molecular biology is to understand functional changes upon single-point mutations in proteins. This is a challenging task for both wet and dry laboratories. In this paper we propose a novel conformational search algorithm, which is based on stochastic optimization rather than MD to circumvent the computational challenge of exploring the breadth of a protein’s structure space. We refer to this algorithm as SIfTER for Structure Initiated Search for Transient Energy Regions. Before relating further details on the novel algorithmic components that make this possible, we justify SIfTER in a gradual and systematic way on a hallmark case study in molecular biology, the family of Ras proteins

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call