Abstract

Protein conformational changes are known to play important roles in assorted biochemical and biological processes. Driven by thermal motions of surrounding solvent molecules, such a structural remodeling often occurs stochastically. Yet, regardless of how random the conformational reconfiguration may appear, it could in principle be described by a linear combination of a set of orthogonal modes which, in turn, are contained in the intramolecular distance distributions. The central challenge is how to obtain the distribution. This contribution proposes a Bayesian data-augmentation scheme to extract the predominant modes from only few distance distributions, be they from computational sampling or directly from experiments such as single-molecule Förster-type resonance energy transfer (smFRET). The inference of the complete protein structure from insufficient data was recognized as isomorphic to the missing-data problem in Bayesian statistical learning. Using smFRET data as an example, the missing coordinates were deduced, given protein structural constraints and multiple but limited number of smFRET distances; the Boltzmann weighing of each inferred protein structure was then evaluated using computational modeling to numerically construct the posterior density for the global protein conformation. The conformational modes were then determined from the iteratively converged overall conformational distribution using principal component analysis. Two examples were presented to illustrate these basic ideas as well as their practical implementation. The scheme described herein was based on the theory behind the powerful Tanner-Wang algorithm that guarantees convergence to the true posterior density. However, instead of assuming a mathematical model to calculate the likelihood as in conventional statistical inference, here the protein structure was treated as a statistical parameter and was imputed from the numerical likelihood function based on structural information, a probability model-free method. The framework put forth here is anticipated to be generally applicable, offering a new way to articulate protein conformational changes in a quantifiable manner.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.