Single-sequence protein structure prediction by integrating protein language models

Xiaoyang Jing,Fandi Wu,Xiao Luo,Jinbo Xu

doi:10.1073/pnas.2308788121

Abstract

Protein structure prediction has been greatly improved by deep learning in the past few years. However, the most successful methods rely on multiple sequence alignment (MSA) of the sequence homologs of the protein under prediction. In nature, a protein folds in the absence of its sequence homologs and thus, a MSA-free structure prediction method is desired. Here, we develop a single-sequence-based protein structure prediction method RaptorX-Single by integrating several protein language models and a structure generation module and then study its advantage over MSA-based methods. Our experimental results indicate that in addition to running much faster than MSA-based methods such as AlphaFold2, RaptorX-Single outperforms AlphaFold2 and other MSA-free methods in predicting the structure of antibodies (after fine-tuning on antibody data), proteins of very few sequence homologs, and single mutation effects. By comparing different protein language models, our results show that not only the scale but also the training data of protein language models will impact the performance. RaptorX-Single also compares favorably to MSA-based AlphaFold2 when the protein under prediction has a large number of sequence homologs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the National Academy of Sciences	Publication Date: Mar 20, 2024
Citations: 4	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Single-sequence protein structure prediction by integrating protein language models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences

Lead the way for us

Similar Papers

Improving protein structure prediction using templates and sequence embedding.
Fandi Wu ... Xiao Luo
Bioinformatics (Oxford, England) | VOL. 39
Fandi Wu, et. al.Fandi Wu ... Xiao Luo
10 Nov 2022
Bioinformatics (Oxford, England) | VOL. 39

Single-sequence-based prediction of protein secondary structures and solvent accessibility by deep whole-sequence learning.
Rhys Heffernan ... Kuldip Paliwal
Journal of Computational Chemistry | VOL. 39
Rhys Heffernan, et. al.Rhys Heffernan ... Kuldip Paliwal
05 Oct 2018
Journal of Computational Chemistry | VOL. 39

Protein structure prediction and conformational transitions
Haitao Cheng
-
Haitao ChengHaitao Cheng
28 Apr 2012
28 Apr 2012

PSSP-RFE: Accurate Prediction of Protein Structural Class by Recursive Feature Extraction from PSI-BLAST Profile, Physical-Chemical Property and Functional Annotations
Liqi Li ... Sanjiu Yu
PLoS ONE | VOL. 9
Liqi Li, et. al.Liqi Li ... Sanjiu Yu
27 Mar 2014
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Single-sequence protein structure prediction by integrating protein language models

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences