Abstract

The appearance at the end of 2019 of the new SARS-CoV-2 coronavirus led to an unprecedented response by the structural biology community, resulting in the rapid determination of many hundreds of structures of proteins encoded by the virus. As part of an effort to analyze and, if necessary, remediate these structures as deposited in the Protein Data Bank (PDB), this work presents a detailed analysis of 81 crystal structures of the main protease 3CLpro, an important target for the design of drugs against COVID-19. The structures of the unliganded enzyme and its complexes with a number of inhibitors were determined by multiple research groups using different experimental approaches and conditions; the resulting structures span 13 different polymorphs representing seven space groups. The structures of the enzyme itself, all determined by molecular replacement, are highly similar, with the exception of one polymorph with a different inter-domain orientation. However, a number of complexes with bound inhibitors were found to pose significant problems. Some of these could be traced to faulty definitions of geometrical restraints for ligands and to the general problem of a lack of such information in the PDB depositions. Several problems with ligand definition in the PDB itself were also noted. In several cases extensive corrections to the models were necessary to adhere to the evidence of the electron-density maps. Taken together, this analysis of a large number of structures of a single, medically important protein, all determined within less than a year using modern experimental tools, should be useful in future studies of other systems of high interest to the biomedical community.

Highlights

  • The appearance at the end of 2019 of the new SARS-CoV-2 coronavirus, the causative agent of COVID-19, led to an unprecedented response by the scientific community

  • Structural biologists were among the first to swing into action, with the first structure of a SARS-CoV-2 protein being deposited in the Protein Data Bank (PDB; Berman et al, 2000) within two weeks of the release of the genomic sequence of the virus

  • This first model was followed by 80 PDB depositions of crystal structures of the same enzyme in the free apo form (Table 1) and in complex with a variety of both covalently and noncovalently bound inhibitors (Table 2)

Read more

Summary

Introduction

The appearance at the end of 2019 of the new SARS-CoV-2 coronavirus, the causative agent of COVID-19, led to an unprecedented response by the scientific community. Structural biologists were among the first to swing into action, with the first structure of a SARS-CoV-2 protein being deposited in the Protein Data Bank (PDB; Berman et al, 2000) within two weeks of the release of the genomic sequence of the virus. This was the structure of the main cysteine protease (PDB entry 6lu7; Jin, Zhao et al, 2020), varyingly named 3CLpro or Mpro and sometimes NSP5. — — Zhang et al (2020) Kneller, Phillips, O’Neill, Tan et al (2020) Su et al (2020) Kneller, Phillips, O’Neill, Jedrzejczak et al (2020) Lee et al (2020) Fu et al (2020) — Kneller, Phillips, Weiss et al (2020) — — — — — — Kneller, Phillips, O’Neill, Tan et al (2020) Vuong et al (2020) — — — — — —

The role and significance of 3CLpro
Three-dimensional structure and the active site of 3CLpro
Substrate-binding site
Data mining and assembly of the reference database
Analysis of the structures and assessment of their quality
Conformation of His41 and modeling of the catalytic water molecule
Comparison of all analyzed structures of 3CLpro
Conclusions and outlook
Related literature
Funding information
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call