Language models for the prediction of SARS-CoV-2 inhibitors.

Andrew E Blanchard,Feiyi Wang,Debsindhu Bhowmik,John Gounley,Junqi Yin,Mayanka Chandra Shekar,Aristeidis Tsaris,Jens Glaser,Isaac Lyngaas,Shang Gao

doi:10.1177/10943420221121804

Andrew E Blanchard, Feiyi Wang + Show 8 more

Open Access

https://doi.org/10.1177/10943420221121804

Copy DOI

Abstract

The COVID-19 pandemic highlights the need for computational tools to automate and accelerate drug design for novel protein targets. We leverage deep learning language models to generate and score drug candidates based on predicted protein binding affinity. We pre-trained a deep learning language model (BERT) on ∼9.6 billion molecules and achieved peak performance of 603 petaflops in mixed precision. Our work reduces pre-training time from days to hours, compared to previous efforts with this architecture, while also increasing the dataset size by nearly an order of magnitude. For scoring, we fine-tuned the language model using an assembled set of thousands of protein targets with binding affinity data and searched for inhibitors of specific protein targets, SARS-CoV-2 Mpro and PLpro. We utilized a genetic algorithm approach for finding optimal candidates using the generation and scoring capabilities of the language model. Our generalizable models accelerate the identification of inhibitors for emerging therapeutic targets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The International Journal of High Performance Computing Applications	Publication Date: Oct 7, 2022
Citations: 16	License type: NO-CC CODE

R Discovery Prime

R Discovery Prime

Language models for the prediction of SARS-CoV-2 inhibitors.

Abstract

Talk to us

Similar Papers

More From: The International Journal of High Performance Computing Applications

Lead the way for us

Similar Papers

How well do pre-trained contextual language representations recommend labels for GitHub issues?
Jun Wang ... Lin Chen
Knowledge-Based Systems | VOL. 232
Jun Wang, et. al.Jun Wang ... Lin Chen
10 Sep 2021
Knowledge-Based Systems | VOL. 232

Patent prior art search using deep learning language model
Dylan Myungchul Kang ... Wookey Lee
-
Dylan Myungchul Kang, et. al.Dylan Myungchul Kang ... Wookey Lee
12 Aug 2020
12 Aug 2020

Composition Based Oxidation State Prediction of Materials Using Deep Learning Language Models.
Nihang Fu ... Jianjun Hu
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. 10
Nihang Fu, et. al.Nihang Fu ... Jianjun Hu
07 Aug 2023
Advanced science (Weinheim, Baden-Wurttemberg, Germany) | VOL. 10

Deep Learning Language Model and Chinese Grammar - Focusing on the Prediction Model of Directional Complements using BERT

CHINESE LITERATURE | VOL. 106

17 Apr 2021
CHINESE LITERATURE | VOL. 106

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language models for the prediction of SARS-CoV-2 inhibitors.

Abstract

Talk to us

Similar Papers

More From: The International Journal of High Performance Computing Applications