Abstract

NMR studies can provide unique information about protein conformations in solution. In CASP14, three reference structures provided by solution NMR methods were available (T1027, T1029, and T1055), as well as a fourth data set of NMR‐derived contacts for an integral membrane protein (T1088). For the three targets with NMR‐based structures, the best prediction results ranged from very good (GDT_TS = 0.90, for T1055) to poor (GDT_TS = 0.47, for T1029). We explored the basis of these results by comparing all CASP14 prediction models against experimental NMR data. For T1027, NMR data reveal extensive internal dynamics, presenting a unique challenge for protein structure prediction methods. The analysis of T1029 motivated exploration of a novel method of “inverse structure determination,” in which an AlphaFold2 model was used to guide NMR data analysis. NMR data provided to CASP predictor groups for target T1088, a 238‐residue integral membrane porin, was also used to assess several NMR‐assisted prediction methods. Most groups involved in this exercise generated similar beta‐barrel models, with good agreement with the experimental data. However, as was also observed in CASP13, some pure prediction groups that did not use any NMR data generated models for T1088 that better fit the NMR data than the models generated using these experimental data. These results demonstrate the remarkable power of modern methods to predict structures of proteins with accuracies rivaling solution NMR structures, and that it is now possible to reliably use prediction models to guide and complement experimental NMR data analysis.

Highlights

  • The remarkable performance of some protein structure prediction groups in the 2020 Critical Assessment of Protein Structure Prediction experiment 14 (CASP14) has set a new standard for protein structure modeling.[1]

  • In the previous 2018 CASP13 experiment, we explored the concept of using incomplete “sparse” solution NMR data to assist protein structure prediction methods.[13]

  • Models were compared with backbone chemical shift data using the TALOS_N program,[16] and residual dipolar coupling (RDC) data where available. These results demonstrate the remarkable accuracy of some CASP14 prediction models, AlphaFold[2], and reveal different reasons for the differences between experimental and prediction models for each target for which the reference struture was determined by NMR methods

Read more

Summary

| INTRODUCTION

The remarkable performance of some protein structure prediction groups in the 2020 Critical Assessment of Protein Structure Prediction experiment 14 (CASP14) has set a new standard for protein structure modeling.[1]. We explored the basis of these results by comparing 1H–1H distance maps derived from these models against the experimental NOESY peak lists using recall and precision scores (RPF-DP scores).[14,15] Models were compared with backbone chemical shift data using the TALOS_N program,[16] and RDC data where available These results demonstrate the remarkable accuracy of some CASP14 prediction models, AlphaFold[2], and reveal different reasons for the differences between experimental and prediction models for each target for which the reference struture was determined by NMR methods

| METHODS
| RESULTS
Findings
| DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.