Uncovering new families and folds in the natural protein universe

Janani Durairaj,Andrew M Waterhouse,Toomas Mets,Tetiana Brodiazhenko,Minhal Abdullah,Gabriel Studer,Gerardo Tauriello,Mehmet Akdel,Antonina Andreeva,Alex Bateman,Tanel Tenson,Vasili Hauryliuk,Torsten Schwede,Joana Pereira

doi:10.1038/s41586-023-06622-3

Janani Durairaj, Andrew M Waterhouse + Show 12 more

Open Access

https://doi.org/10.1038/s41586-023-06622-3

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

We are now entering a new era in protein sequence and structure annotation, with hundreds of millions of predicted protein structures made available through the AlphaFold database1. These models cover nearly all proteins that are known, including those challenging to annotate for function or putative biological role using standard homology-based approaches. In this study, we examine the extent to which the AlphaFold database has structurally illuminated this ‘dark matter’ of the natural protein universe at high predicted accuracy. We further describe the protein diversity that these models cover as an annotated interactive sequence similarity network, accessible at https://uniprot3d.org/atlas/AFDB90v4. By searching for novelties from sequence, structure and semantic perspectives, we uncovered the β-flower fold, added several protein families to Pfam database2 and experimentally demonstrated that one of these belongs to a new superfamily of translation-targeting toxin–antitoxin systems, TumE–TumA. This work underscores the value of large-scale efforts in identifying, annotating and prioritizing new protein families. By leveraging the recent deep learning revolution in protein bioinformatics, we can now shed light into uncharted areas of the protein universe at an unprecedented scale, paving the way to innovations in life sciences and biotechnology.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature	Publication Date: Sep 13, 2023
Citations: 57	License type: CC BY 4.0

R Discovery Prime

Uncovering new families and folds in the natural protein universe

Abstract

Published Version

Talk to us

Similar Papers

More From: Nature

Lead the way for us

Similar Papers

GAP Final Technical Report 12-14-04

-

14 Dec 2004
GAP Final Technical Report 12-14-04

A Multimodal Protein Representation Framework for Quantifying Transferability Across Biochemical Downstream Tasks.
Fan Hu ... Yi Pan
Advanced Science | VOL. 10
Fan Hu, et. al.Fan Hu ... Yi Pan
30 May 2023
Advanced Science | VOL. 10

Evolutionary trace report_maker: a new type of service for comparative analysis of proteins
I Mihalek ... O Lichtarge
Bioinformatics | VOL. 22
I Mihalek, et. al.I Mihalek ... O Lichtarge
27 Apr 2006
Bioinformatics | VOL. 22

A Proteogenomic Survey of the Medicago truncatula Genome
Jeremy D Volkening ... Michael R Sussman
Molecular & Cellular Proteomics | VOL. 11
Jeremy D Volkening, et. al.Jeremy D Volkening ... Michael R Sussman
01 Oct 2012
Molecular & Cellular Proteomics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Uncovering new families and folds in the natural protein universe

Abstract

Published Version

Talk to us

Similar Papers

More From: Nature