Contrastive learning in protein language space predicts interactions between drugs and protein targets

Rohit Singh,Samuel Sledzieski,Bryan Bryson,Lenore Cowen,Bonnie Berger

doi:10.1073/pnas.2220778120

Abstract

Sequence-based prediction of drug-target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models ("PLex") and employing a protein-anchored contrastive coembedding ("Con") to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (KD = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug-target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Jun 8, 2023
Citations: 33	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Contrastive learning in protein language space predicts interactions between drugs and protein targets

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Similar Papers

On the Power of Pre-Trained Text Representations
Yu Meng ... Jiaxin Huang
-
Yu Meng, et. al.Yu Meng ... Jiaxin Huang
14 Aug 2021
14 Aug 2021

Multi-Faceted Knowledge-Driven Pre-Training for Product Representation Learning
Denghui Zhang ... Yanjie Fu
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Denghui Zhang, et. al.Denghui Zhang ... Yanjie Fu
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Neural Transfer Learning For Vietnamese Sentiment Analysis Using Pre-trained Contextual Language Models
An Pha Le ... Thanh-Van Le
-
An Pha Le, et. al.An Pha Le ... Thanh-Van Le
16 Dec 2021
16 Dec 2021

A Multi-tasking and Multi-stage Chinese Minority Pre-trained Language Model
Bin Li ... Shutao Li
-
Bin Li, et. al.Bin Li ... Shutao Li
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contrastive learning in protein language space predicts interactions between drugs and protein targets

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America