Abstract
The fact that proteins can have their chain formed in a knot is known for almost 30 years. However, as they are not common, only a fraction of such proteins is available in the Protein Data Bank. It was not possible to assess their importance and versatility up until now because we did not have access to the whole proteome of an organism, let alone a human one. The arrival of efficient machine learning methods for protein structure prediction, such as AlphaFold and RoseTTaFold, changed that. We analyzed all proteins from the human proteome (over 20,000) determined with AlphaFold in search for knots and found them in less than 2% of the structures. Using a variety of methods, including homolog search, clustering, quality assessment and visual inspection, we determined the nature of each of the knotted structures and classified it as either knotted, potentially knotted, or an artifact, and deposited all of them in a database available at: https://knotprot.cent.uw.edu.pl/alphafold. Overall, we found 51 credible knotted proteins (0.2% of human proteome). The set of potentially knotted structures includes a new complex type of a knot not reported in proteins yet. That knot type, denoted 63 in mathematical notation, would necessitate a more complex folding path than any knotted protein characterized to date. This article is protected by copyright. All rights reserved.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.