Breaking the barriers of data scarcity in drug-target affinity prediction.

Qizhi Pei,Tie-Yan Liu,Yingce Xia,Haiguang Liu,Lijun Wu,Jinhua Zhu,Shufang Xie,Rui Yan,Tao Qin

doi:10.1093/bib/bbad386

Abstract

Accurate prediction of drug-target affinity (DTA) is of vital importance in early-stage drug discovery, facilitating the identification of drugs that can effectively interact with specific targets and regulate their activities. While wet experiments remain the most reliable method, they are time-consuming and resource-intensive, resulting in limited data availability that poses challenges for deep learning approaches. Existing methods have primarily focused on developing techniques based on the available DTA data, without adequately addressing the data scarcity issue. To overcome this challenge, we present the Semi-Supervised Multi-task training (SSM) framework for DTA prediction, which incorporates three simple yet highly effective strategies: (1) A multi-task training approach that combines DTA prediction with masked language modeling using paired drug-target data. (2) A semi-supervised training method that leverages large-scale unpaired molecules and proteins to enhance drug and target representations. This approach differs from previous methods that only employed molecules or proteins in pre-training. (3) The integration of a lightweight cross-attention module to improve the interaction between drugs and targets, further enhancing prediction accuracy. Through extensive experiments on benchmark datasets such as BindingDB, DAVIS and KIBA, we demonstrate the superior performance of our framework. Additionally, we conduct case studies on specific drug-target binding activities, virtual screening experiments, drug feature visualizations and real-world applications, all of which showcase the significant potential of our work. In conclusion, our proposed SSM-DTA framework addresses the data limitation challenge in DTA prediction and yields promising results, paving the way for more efficient and accurate drug discovery processes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Breaking the barriers of data scarcity in drug-target affinity prediction.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Journal: Briefings in Bioinformatics	Publication Date: Sep 22, 2023
Citations: 7

Similar Papers

Semi-supervised training strategies for deep neural networks
Matthew Gibson ... Puming Zhan
-
Matthew Gibson, et. al.Matthew Gibson ... Puming Zhan
01 Dec 2017
01 Dec 2017

GEFA: Early Fusion Approach in Drug-Target Affinity Prediction.
Tri Minh Nguyen ... Thao Minh Le
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19
Tri Minh Nguyen, et. al.Tri Minh Nguyen ... Thao Minh Le
01 Mar 2022
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 19

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction
Masahiro Kaneko ... Masato Mita
-
Masahiro Kaneko, et. al.Masahiro Kaneko ... Masato Mita
01 Jan 2020
01 Jan 2020

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction
Masahiro Kaneko
Journal of Natural Language Processing | VOL. 27
Masahiro KanekoMasahiro Kaneko
15 Sep 2020
Journal of Natural Language Processing | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Breaking the barriers of data scarcity in drug-target affinity prediction.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics