Abstract

The success rate of molecular replacement (MR) falls considerably when search models share less than 35% sequence identity with their templates, but can be improved significantly by using fold-recognition methods combined with exhaustive MR searches. Models based on alignments calculated with fold-recognition algorithms are more accurate than models based on conventional alignment methods such as FASTA or BLAST, which are still widely used for MR. In addition, by designing MR pipelines that integrate phasing and automated refinement and allow parallel processing of such calculations, one can effectively increase the success rate of MR. Here, updated results from the JCSG MR pipeline are presented, which to date has solved 33 MR structures with less than 35% sequence identity to the closest homologue of known structure. By using difficult MR problems as examples, it is demonstrated that successful MR phasing is possible even in cases where the similarity between the model and the template can only be detected with fold-recognition algorithms. In the first step, several search models are built based on all homologues found in the PDB by fold-recognition algorithms. The models resulting from this process are used in parallel MR searches with different combinations of input parameters of the MR phasing algorithm. The putative solutions are subjected to rigid-body and restrained crystallographic refinement and ranked based on the final values of free R factor, figure of merit and deviations from ideal geometry. Finally, crystal packing and electron-density maps are checked to identify the correct solution. If this procedure does not yield a solution with interpretable electron-density maps, then even more alternative models are prepared. The structurally variable regions of a protein family are identified based on alignments of sequences and known structures from that family and appropriate trimmings of the models are proposed. All combinations of these trimmings are applied to the search models and the resulting set of models is used in the MR pipeline. It is estimated that with the improvements in model building and exhaustive parallel searches with existing phasing algorithms, MR can be successful for more than 50% of recognizable homologues of known structures below the threshold of 35% sequence identity. This implies that about one-third of the proteins in a typical bacterial proteome are potential MR targets.

Highlights

  • Molecular replacement (MR; Rossmann, 2001) has an advantage over experimental phasing techniques because it requires only one data set of reflections obtained from a native protein crystal, which is considerably less resource-intensive doi:10.1107/S0907444907050111 133 research papers than multiple-wavelength experiments with substituted protein crystals.Because of advances in structural biology, more and more structures are available through the Protein Data Bank (PDB; Berman et al, 2000)

  • The success rate of molecular replacement (MR) falls considerably when search models share less than 35% sequence identity with their templates, but can be improved significantly by using fold-recognition methods combined with exhaustive MR searches

  • We provide a short description of the JCSG MR pipeline, discuss the advantages of using sensitive fold-recognition algorithms and show the benefits of applying parameter-space screening to MR searches

Read more

Summary

Introduction

Molecular replacement (MR; Rossmann, 2001) has an advantage over experimental phasing techniques because it requires only one data set of reflections obtained from a native protein crystal, which is considerably less resource-intensive doi:10.1107/S0907444907050111 133 research papers than multiple-wavelength experiments with substituted protein crystals. Several automated computational algorithms for solving this problem are available in popular programs such as Phaser (Storoni et al, 2004), AMoRe (Navaza, 2001), X-PLOR/CNS (Brunger et al, 1998), MOLREP (Vagin & Teplyakov, 2000), EPMR (Kissinger et al, 1999) and Queen of Spades (Glykos & Kokkinidis, 2000) The success of these MR methods depends critically on the quality of the model used and different ways of preparing models are still being explored. We demonstrated that it is possible to extend the limits of the MR method by using several designed protein models based on profile–profile fold recognition and exhaustive MR searches in a parallelized and automated MR pipeline (Schwarzenbacher et al, 2004) built at the Joint Center for Structural Genomics (Lesley et al, 2002).

The JCSG MR pipeline and its results
Parameter-space screening in MR searches
28 X X X 1o54
X 1o5i
Combinatorial trimming of search models
Findings
Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call