An increasing variety of experimental techniques for protein structure determination rely on the site-specific incorporation of probes. Examples include NMR using the paramagnetic relaxation enhancement (PRE) effect, fluorescence using FRET and related techniques, and site-directed chemical cross-linking quantified by mass-spectrometry. In each case, the probe locations must be carefully chosen, which poses a major challenge, as some probe locations are highly informative, whereas others yield little useful information. In the absence of structural knowledge, how does one efficiently choose probe locations? We present a method called Rational Probe Mutagenesis (RPM) that relies on iteration between computation and experiment to rapidly determine protein structures. RPM is based on the idea of reducing the “width” of the conformational ensemble as rapidly as possible. An ensemble is generated using whatever data is currently available. This ensemble is then analyzed to find the experiment that maximally discriminates between possible structures. The resulting experimental data is then used to generate a new updated ensemble and the process repeated. We demonstrate RPM on several simple model systems and a challenging real-world solid-state NMR dataset, where we show that RPM can choose highly informative experiments, leading to potential time and cost savings by avoiding redundant or uninformative experiments.
Read full abstract