Complementary multi-modality molecular self-supervised learning via non-overlapping masking for property prediction.

Ao Shen,Mingzhi Yuan,Yingfan Ma,Jie Du,Manning Wang

doi:10.1093/bib/bbae256

Abstract

Self-supervised learning plays an important role in molecular representation learning because labeled molecular data are usually limited in many tasks, such as chemical property prediction and virtual screening. However, most existing molecular pre-training methods focus on one modality of molecular data, and the complementary information of two important modalities, SMILES and graph, is not fully explored. In this study, we propose an effective multi-modality self-supervised learning framework for molecular SMILES and graph. Specifically, SMILES data and graph data are first tokenized so that they can be processed by a unified Transformer-based backbone network, which is trained by a masked reconstruction strategy. In addition, we introduce a specialized non-overlapping masking strategy to encourage fine-grained interaction between these two modalities. Experimental results show that our framework achieves state-of-the-art performance in a series of molecular property prediction tasks, and a detailed ablation study demonstrates efficacy of the multi-modality framework and the masking strategy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Complementary multi-modality molecular self-supervised learning via non-overlapping masking for property prediction.

Abstract

Talk to us

Similar Papers

More From: Briefings in bioinformatics

Lead the way for us

Journal: Briefings in bioinformatics	Publication Date: May 23, 2024
License type: cc-by

Similar Papers

Self-Supervised Learning With Segmental Masking for Speech Representation
Xianghu Yue ... Fabian Ritter Gutierrez
IEEE Journal of Selected Topics in Signal Processing | VOL. 16
Xianghu Yue, et. al.Xianghu Yue ... Fabian Ritter Gutierrez
01 Oct 2022
IEEE Journal of Selected Topics in Signal Processing | VOL. 16

HoopTransformer: Advancing NBA Offensive Play Recognition with Self-Supervised Learning from Player Trajectories.
Xing Wang ... Shaoliang Zhang
Sports medicine (Auckland, N.Z.) | VOL. 54
Xing Wang, et. al.Xing Wang ... Shaoliang Zhang
01 Oct 2024
Sports medicine (Auckland, N.Z.) | VOL. 54

Self-Supervised Multimodal Learning: A Survey.
Yongshuo Zong ... Timothy Hospedales
IEEE transactions on pattern analysis and machine intelligence | VOL. PP
Yongshuo Zong, et. al.Yongshuo Zong ... Timothy Hospedales
01 Jan 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. PP

MPS-AMS: Masked Patches Selection and Adaptive Masking Strategy Based Self-Supervised Medical Image Segmentation
Xiangtao Wang ... Thomas Lukasiewicz
-
Xiangtao Wang, et. al.Xiangtao Wang ... Thomas Lukasiewicz
04 Jun 2023
04 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Complementary multi-modality molecular self-supervised learning via non-overlapping masking for property prediction.

Abstract

Talk to us

Similar Papers

More From: Briefings in bioinformatics