S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.

Duolin Wang,Mahdi Pourmirzaei,Usman L Abbas,Shuai Zeng,Negin Manshour,Farzaneh Esmaili,Biplab Poudel,Yuexu Jiang,Qing Shao,Jin Chen,Dong Xu

doi:10.1002/advs.202404212

Abstract

Proteins play an essential role in various biological and engineering processes. Large protein language models (PLMs) present excellent potential to reshape protein research by accelerating the determination of protein functions and the design of proteins with the desired functions. The prediction and design capacity of PLMs relies on the representation gained from the protein sequences. However, the lack of crucial 3D structure information in most PLMs restricts the prediction capacity of PLMs in various applications, especially those heavily dependent on 3D structures. To address this issue, S-PLM is introduced as a 3D structure-aware PLM that utilizes multi-view contrastive learning to align the sequence and 3D structure of a protein in a coordinated latent space. S-PLM applies Swin-Transformer on AlphaFold-predicted protein structures to embed the structural information and fuses it into sequence-based embedding from ESM2. Additionally, a library of lightweight tuning tools is provided to adapt S-PLM for diverse downstream protein prediction tasks. The results demonstrate S-PLM's superior performance over sequence-only PLMs on all protein clustering and classification tasks, achieving competitiveness comparable to state-of-the-art methods requiring both sequence and structure inputs. S-PLM and its lightweight tuning tools are available at https://github.com/duolinwang/S-PLM/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.

Abstract

Talk to us

Similar Papers

More From: Advanced science (Weinheim, Baden-Wurttemberg, Germany)

Lead the way for us

Journal: Advanced science (Weinheim, Baden-Wurttemberg, Germany)	Publication Date: Dec 12, 2024
License type: cc-by

Similar Papers

Implantable Ultrasound-Powered MXene/PVA Hydrogel-Based Generator for Treatment of Glioblastoma.
Xiaoping Hu ... Bingzhe Xu
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -
Xiaoping Hu, et. al.Xiaoping Hu ... Bingzhe Xu
12 Dec 2024
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -

Microglia Process α-Synuclein Fibrils and Enhance their Pathogenicity in a TREM2-Dependent Manner.
Min Xiong ... Zhentao Zhang
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -
Min Xiong, et. al.Min Xiong ... Zhentao Zhang
12 Dec 2024
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -

Ubiquitination of CD47 Regulates Innate Anti-Tumor Immune Response.
Qian Gou ... Yongzhong Hou
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -
Qian Gou, et. al.Qian Gou ... Yongzhong Hou
12 Dec 2024
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -

S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.
Duolin Wang ... Dong Xu
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -
Duolin Wang, et. al.Duolin Wang ... Dong Xu
12 Dec 2024
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.

Abstract

Talk to us

Similar Papers

More From: Advanced science (Weinheim, Baden-Wurttemberg, Germany)