Sequence-based prediction of protein binding regions and drug\u2013target interactions

Ingoo Lee,Hojung Nam

doi:10.1186/s13321-022-00584-w

Abstract

Identifying drug–target interactions (DTIs) is important for drug discovery. However, searching all drug–target spaces poses a major bottleneck. Therefore, recently many deep learning models have been proposed to address this problem. However, the developers of these deep learning models have neglected interpretability in model construction, which is closely related to a model’s performance. We hypothesized that training a model to predict important regions on a protein sequence would increase DTI prediction performance and provide a more interpretable model. Consequently, we constructed a deep learning model, named Highlights on Target Sequences (HoTS), which predicts binding regions (BRs) between a protein sequence and a drug ligand, as well as DTIs between them. To train the model, we collected complexes of protein–ligand interactions and protein sequences of binding sites and pretrained the model to predict BRs for a given protein sequence–ligand pair via object detection employing transformers. After pretraining the BR prediction, we trained the model to predict DTIs from a compound token designed to assign attention to BRs. We confirmed that training the BRs prediction model indeed improved the DTI prediction performance. The proposed HoTS model showed good performance in BR prediction on independent test datasets even though it does not use 3D structure information in its prediction. Furthermore, the HoTS model achieved the best performance in DTI prediction on test datasets. Additional analysis confirmed the appropriate attention for BRs and the importance of transformers in BR and DTI prediction. The source code is available on GitHub (https://github.com/GIST-CSBL/HoTS).

Highlights

Identifying drug–target interactions (DTIs) is a crucial step in drug discovery
As stated above, the average precision (AP) dropped significantly at the first DTI training epoch, AP values for additional DTI training epochs converged following the trend of those for the Binding region (BR) prediction epochs
Given the observed convergence in model performance, we interpret that the BR and DTI prediction models shared common features

Summary

Introduction

Identifying drug–target interactions (DTIs) is a crucial step in drug discovery. As it is not feasible to test all chemical compounds against a given target protein, in silico prediction of possible active compounds using massive chemical libraries can increase the efficiency of drug discovery [1]. Thanks to the vast amount of information on drug compounds and their targets [2], as well as advances in computing power, researchers have been able to develop DTI prediction models using the proteochemometric (PCM) approach [3]. As protein feature engineering for DTI prediction, identification of binding pockets/sites is important for prediction performance and comprehensive modeling [13,14,15]. Many computational models have been developed to identify binding pockets/sites.

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of cheminformatics	Publication Date: Feb 8, 2022
Citations: 29	License type: open-access

R Discovery Prime

R Discovery Prime

Sequence-based prediction of protein binding regions and drug\u2013target interactions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of cheminformatics

Lead the way for us

Similar Papers

Deep Learning in Drug Target Interaction Prediction: Current and Future Perspectives.
Karim Abbasi ... Ali Masoudi-Nejad
Current Medicinal Chemistry | VOL. 28
Karim Abbasi, et. al.Karim Abbasi ... Ali Masoudi-Nejad
07 Sep 2020
Current Medicinal Chemistry | VOL. 28

A Comparative Study of Amino Acid Encoding Methods for Predicting Drug-Target Interactions in COVID-19 Disease
Talha Burak Alakus ... Ibrahim Turkoglu
-
Talha Burak Alakus, et. al.Talha Burak Alakus ... Ibrahim Turkoglu
02 Nov 2021
02 Nov 2021

GraphormerDTI: A graph transformer-based approach for drug-target interaction prediction
Mengmeng Gao ... Yi Chen
Computers in Biology and Medicine | VOL. 173
Mengmeng Gao, et. al.Mengmeng Gao ... Yi Chen
18 Mar 2024
Computers in Biology and Medicine | VOL. 173

EDC-DTI: An end-to-end deep collaborative learning model based on multiple information for drug-target interactions prediction
Yongna Yuan ... Lei Liu
Journal of Molecular Graphics and Modelling | VOL. 122
Yongna Yuan, et. al.Yongna Yuan ... Lei Liu
21 Apr 2023
Journal of Molecular Graphics and Modelling | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequence-based prediction of protein binding regions and drug\u2013target interactions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of cheminformatics