Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays

Bilge Eren Yamasan,Selçuk Korkmaz

doi:10.4274/balkanmedj.galenos.2024.2024-1-73

Bilge Eren Yamasan, Selçuk Korkmaz

Open Access

https://doi.org/10.4274/balkanmedj.galenos.2024.2024-1-73

Copy DOI

Journal: Balkan medical journal	Publication Date: May 3, 2024
License type: cc-by-nc-nd

Abstract

The coronavirus disease-2019 (COVID-19) pandemic, caused by severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2), has urgently necessitated effective therapeutic solutions, with a focus on rapidly identifying and classifying potential small-molecule drugs. Given traditional methods’ labor-intensive and time-consuming nature, deep learning has emerged as an essential tool for efficiently processing and extracting insights from complex biological data. To utilize deep learning techniques, particularly deep neural networks (DNN) enhanced with the synthetic minority oversampling technique (SMOTE), to enhance the classification of binding activities in anti-SARS-CoV-2 molecules across various bioassays. We used 11 bioassay datasets covering various SARS-CoV-2 interactions and inhibitory mechanisms. These assays ranged from spike-ACE2 protein-protein interaction to ACE2 enzymatic activity and 3CL enzymatic activity. To address the prevalent class imbalance in these datasets, the SMOTE technique was employed to generate new samples for the minority class. In our model-building approach, we divided the dataset into 80% training and 20% test sets, reserving 10% of the training set for validation. Our approach involved employing a DNN that integrates ReLU and sigmoid activation functions, incorporates batch normalization, and uses Adam optimization. The hyperparameters and architecture of the DNN were optimized through various tests on layers, minibatch sizes, epoch sizes, and learning rates. A 40% dropout rate was incorporated to mitigate overfitting. For model evaluation, we computed performance metrics, such as balanced accuracy (BACC), precision, recall, F1 score, Matthews’ correlation coefficient (MCC), and area under the curve (AUC). The performance of the DNN across 11 bioassay test sets revealed varying outcomes, significantly influenced by the ratios of active-to-inactive compounds. Assays, such as AlphaLISA and CoV-PPE, demonstrated robust performance across various metrics, including BACC, precision, recall, and AUC, when configured with more balanced ratios (1:3 and 1:1, respectively). This suggests the effective identification of active compounds in both cases. In contrast, assays with higher imbalance ratios, such as 3CL (1:38) and cytopathic effect (1:15), demonstrated higher recall but lower precision, highlighting challenges in accurately identifying active compounds among numerous inactive compounds. However, even in these challenging settings, the model achieved favorable BACC and recall scores. Overall, the DNN model generally performed well, as indicated by the BACC, MCC, and AUC values, especially when considering the degree of dataset imbalance in each assay. This study demonstrates the significant impact of deep learning, particularly DNN models enhanced with SMOTE, in improving the identification of active compounds in bioassay datasets for COVID-19 drug discovery, outperforming traditional machine learning models. Furthermore, this study highlights the efficacy of advanced computational techniques in addressing high-throughput screening data imbalances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays

Abstract

Talk to us

Similar Papers

More From: Balkan medical journal

Lead the way for us

Similar Papers

Improved breast lesion detection in mammogram images using a deep neural network.
Wen Zhou ... Xiaoying Wang
Diagnostic and Interventional Radiology | VOL. 29
Wen Zhou, et. al.Wen Zhou ... Xiaoying Wang
01 Jul 2023
Diagnostic and Interventional Radiology | VOL. 29

Computer-aided diagnosis of ground glass pulmonary nodule by fusing deep learning and radiomics features
Xianfang Hu ... Haiming Li
Physics in Medicine & Biology | VOL. 66
Xianfang Hu, et. al.Xianfang Hu ... Haiming Li
08 Mar 2021
Physics in Medicine & Biology | VOL. 66

Decision letter: SARS-CoV-2 shedding dynamics across the respiratory tract, sex, and disease severity for adult and pediatric COVID-19
Lucie Vermeulen
-
Lucie VermeulenLucie Vermeulen
03 Aug 2021
03 Aug 2021

SARS-CoV-2 mRNA vaccine induces robust specific and cross-reactive IgG and unequal neutralizing antibodies in naive and previously infected people
Tara M Narowski ... Lakshmanane Premkumar
Cell Reports | VOL. 38
Tara M Narowski, et. al.Tara M Narowski ... Lakshmanane Premkumar
20 Jan 2022
Cell Reports | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays

Abstract

Talk to us

Similar Papers

More From: Balkan medical journal