Abstract

In structure determination by X-ray crystallography and solution NMR spectroscopy, experimental data are collected as time and ensemble-averages. Thus, in principle, appropriate time and ensemble-averaged models should be used. Refinement of an ensemble of conformers rather than one single structure against the experimental NMR data could, however, result in overfitting the data because of the significantly increased number of parameters. To avoid overfitting, complete cross-validation, which provides an unbiased measure of the fit, has been applied to nuclear Overhauser effect derived distance refinement. Using two synthetic test cases, a correlation was demonstrated between the cross-validated measure to the fit (defined in terms of root-mean-square deviations from the distance restraints and number of violations) and the number of models that best reproduce the conformational variability in solution. A new method, based on a probability map, has been used to generate good representations of the resulting ensembles of structures. The method has also been applied to observed NMR data for two proteins, interleukin 4 and interleukin 8. For interleukin 4, cross-validation indicates that a single-conformer model gives the most accurate representation of the structure, whereas conventional measures of fit between the experimental data and those calculated from the model decrease when increasing the number of conformers, indicating overfitting. For interleukin 8, complete cross-validation predicts a twin-conformer model to be the most faithful representation of the experimental data. Two distinct conformations for the loop formed by residues 16 to 22 emerge from the family of twin-conformer structures. The putative alternate conformation of the loop is not observed in the crystal structure of interleukin 8. However, because of crystal packing contacts in this region this does not necessarily exclude the presence of the alternate conformation in solution. The twin-conformer model is supported by observed chemical exchange line broadening for the amide of His 18 obtained by15N relaxation studies. This region has also been implied to be involved in receptor binding.f2f2Abbreviations used: NOE, nuclear Overhauser effect; DG, distance geometry; SA, simulated annealing; MD, molecular dynamics; r. m. s. d., root-mean-squared deviation; IL-4, interleukin 4; IL-8, interleukin 8; PDB, Protein Data Bank.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call