INTransformer: Data augmentation-based contrastive learning by injecting noise into transformer for molecular property prediction

Jing Jiang,Yachao Li,Ruisheng Zhang,Yunwu Liu

doi:10.1016/j.jmgm.2024.108703

Abstract

Molecular property prediction plays an essential role in drug discovery for identifying the candidate molecules with target properties. Deep learning models usually require sufficient labeled data to train good prediction models. However, the size of labeled data is usually small for molecular property prediction, which brings great challenges to deep learning-based molecular property prediction methods. Furthermore, the global information of molecules is critical for predicting molecular properties. Therefore, we propose INTransformer for molecular property prediction, which is a data augmentation method via contrastive learning to alleviate the limitations of the labeled molecular data while enhancing the ability to capture global information. Specifically, INTransformer consists of two identical Transformer sub-encoders to extract the molecular representation from the original SMILES and noisy SMILES respectively, while achieving the goal of data augmentation. To reduce the influence of noise, we use contrastive learning to ensure the molecular encoding of noisy SMILES is consistent with that of the original input so that the molecular representation information can be better extracted by INTransformer. Experiments on various benchmark datasets show that INTransformer achieved competitive performance for molecular property prediction tasks compared with the baselines and state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

INTransformer: Data augmentation-based contrastive learning by injecting noise into transformer for molecular property prediction

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Graphics and Modelling

Lead the way for us

Similar Papers

MvMRL: a multi-view molecular representation learning method for molecular property prediction.
Ru Zhang ... Yuzhong Peng
Briefings in bioinformatics | VOL. 25
Ru Zhang, et. al.Ru Zhang ... Yuzhong Peng
23 May 2024
Briefings in bioinformatics | VOL. 25

Attention-wise masked graph contrastive learning for predicting molecular property.
Hui Liu ... Xuejun Liu
Briefings in Bioinformatics | VOL. 23
Hui Liu, et. al.Hui Liu ... Xuejun Liu
06 Aug 2022
Briefings in Bioinformatics | VOL. 23

A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation.
Chunyan Li ... Shihu Liu
Computational Intelligence and Neuroscience | VOL. 2022
Chunyan Li, et. al.Chunyan Li ... Shihu Liu
28 Jan 2022
Computational Intelligence and Neuroscience | VOL. 2022

Chemical Property Relation Guided Few-Shot Molecular Property Prediction
Shaolun Yao ... Zunlei Feng
-
Shaolun Yao, et. al.Shaolun Yao ... Zunlei Feng
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

INTransformer: Data augmentation-based contrastive learning by injecting noise into transformer for molecular property prediction

Abstract

Talk to us

Similar Papers

More From: Journal of Molecular Graphics and Modelling