Mini-batch Size Research Articles

The coronavirus disease-2019 (COVID-19) pandemic, caused by severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2), has urgently necessitated effective therapeutic solutions, with a focus on rapidly identifying and classifying potential small-molecule drugs. Given traditional methods’ labor-intensive and time-consuming nature, deep learning has emerged as an essential tool for efficiently processing and extracting insights from complex biological data. To utilize deep learning techniques, particularly deep neural networks (DNN) enhanced with the synthetic minority oversampling technique (SMOTE), to enhance the classification of binding activities in anti-SARS-CoV-2 molecules across various bioassays. We used 11 bioassay datasets covering various SARS-CoV-2 interactions and inhibitory mechanisms. These assays ranged from spike-ACE2 protein-protein interaction to ACE2 enzymatic activity and 3CL enzymatic activity. To address the prevalent class imbalance in these datasets, the SMOTE technique was employed to generate new samples for the minority class. In our model-building approach, we divided the dataset into 80% training and 20% test sets, reserving 10% of the training set for validation. Our approach involved employing a DNN that integrates ReLU and sigmoid activation functions, incorporates batch normalization, and uses Adam optimization. The hyperparameters and architecture of the DNN were optimized through various tests on layers, minibatch sizes, epoch sizes, and learning rates. A 40% dropout rate was incorporated to mitigate overfitting. For model evaluation, we computed performance metrics, such as balanced accuracy (BACC), precision, recall, F1 score, Matthews’ correlation coefficient (MCC), and area under the curve (AUC). The performance of the DNN across 11 bioassay test sets revealed varying outcomes, significantly influenced by the ratios of active-to-inactive compounds. Assays, such as AlphaLISA and CoV-PPE, demonstrated robust performance across various metrics, including BACC, precision, recall, and AUC, when configured with more balanced ratios (1:3 and 1:1, respectively). This suggests the effective identification of active compounds in both cases. In contrast, assays with higher imbalance ratios, such as 3CL (1:38) and cytopathic effect (1:15), demonstrated higher recall but lower precision, highlighting challenges in accurately identifying active compounds among numerous inactive compounds. However, even in these challenging settings, the model achieved favorable BACC and recall scores. Overall, the DNN model generally performed well, as indicated by the BACC, MCC, and AUC values, especially when considering the degree of dataset imbalance in each assay. This study demonstrates the significant impact of deep learning, particularly DNN models enhanced with SMOTE, in improving the identification of active compounds in bioassay datasets for COVID-19 drug discovery, outperforming traditional machine learning models. Furthermore, this study highlights the efficacy of advanced computational techniques in addressing high-throughput screening data imbalances.

Read full abstract

Abstract Background Early detection of left ventricular systolic dysfunction (LVSD) using minimally invasive electrocardiographic assessment in an ambulatory setting may be important considering the increasing economic burden of HF. Recently artificial intelligence (AI) algorithms have been reported to classify reduced ejection fraction using snapshot 12-lead ECG measurements, however the ability of AI to identify LVSD using only a single lead ambulatory ECG is unknown. Purpose We aimed to develop a convolution neural network (CNN) based deep learning algorithm using single lead ECG to predict ejection fraction less than 40% in ambulatory patients implanted with implantable loop recorders (ILRs). Methods We linked ILR patients with LVEF measurements from a de-identified database of aggregated electronic health record (EHR) data during the period of 2007-2021 to a manufacturer’s device database with ambulatory single lead ECGS. The routine ECG transmissions from the ILR devices were paired with LVEF measured within a week. The data was pre-processed to obtain the time-frequency components of ECG using wavelet transform. The wavelet coefficients were then used as input to train and validate a 2D-CNN model. An independent test dataset consisting of data from patient not included in training/validation dataset was used to assess the performance of CNN model to classify LVEF&lt; = 40% using the metrics area under the curve (AUC), sensitivity and specificity. Results A total of 35,741 unique LVEF-ECG pairs collected from 2,249 ILR patients were used to train the model; an independent validation dataset consisted of 6,721 unique LVEF ECG pairs from 750 patients and independent test dataset consisted of 6,611 unique LVEF ECG pairs from 750 patients. In our CNN model, cross entropy was used as loss function and adaptive moment estimation (ADAM) was used as the optimizer. The initial learning rate was selected as 0.0001, epochs as 20 and mini batch size as 64. The threshold to delineate reduced ejection fraction (LVEF&lt; = 40%) was set as 0.1 (Figure 1). The model yielded accuracy, sensitivity, specificity, AUC of 75%,70%, 76% and 0.8 respectively in the independent test set (Figure 2). Conclusion A deep learning algorithm applied to ambulatory single lead ECGs acquired by ILR can detect LVEF&lt; = 40%. This continuous ambulatory AI-ECG monitoring may allow to identify longitudinal changes in LVEF post ILR implant via remote monitoring.Figure 1:Model output vs Actual EF rangeFigure 2:ROC

Read full abstract

Mini-batch Size Research Articles

Related Topics

Articles published on Mini-batch Size

A comparative study of hot tensile deformation behavior of 6016 aluminum alloy under LSTM neural network and Arrhenius model

Parallel and scalable AI in HPC systems for CFD applications and beyond

Learning Joint Topic Representation for Detecting Drift in Social Media Text

Deep SqueezeNet learning model for diagnosis and prediction of maize leaf diseases

SRPM-ST: Sequential retraining and pseudo-labeling in mini-batches for self-training

DYNAMITE: Dynamic Interplay of Mini-Batch Size and Aggregation Frequency for Federated Learning With Static and Streaming Datasets

Tuning VGG19 hyperparameters for improved pneumonia classification

A stochastic gradient method with variance control and variable learning rate for Deep Learning

Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems

Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays

Enhancing Skin Disease Classification: A Novel Approach With Tailored Loss Functions And SMOTE Sumeet Ghumare

Prediction of sloshing pressure using image-based deep learning

Phase transitions in the mini-batch size for sparse and dense two-layer neural networks

An Inexact Sequential Quadratic Programming Method for Learning and Control of Recurrent Neural Networks.

ResNet-50 vs. EfficientNet-B0: Multi-Centric Classification of Various Lung Abnormalities Using Deep Learning

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models.

Inexact tensor methods and their application to stochastic convex optimization

Deep learning model to identify potential occurrence of reduced LVEF in patients with implantable loop recorders

Towards accelerating model parallelism in distributed deep learning systems.

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mini-batch Size Research Articles

Related Topics

Articles published on Mini-batch Size

A comparative study of hot tensile deformation behavior of 6016 aluminum alloy under LSTM neural network and Arrhenius model

Parallel and scalable AI in HPC systems for CFD applications and beyond

Learning Joint Topic Representation for Detecting Drift in Social Media Text

Deep SqueezeNet learning model for diagnosis and prediction of maize leaf diseases

SRPM-ST: Sequential retraining and pseudo-labeling in mini-batches for self-training

DYNAMITE: Dynamic Interplay of Mini-Batch Size and Aggregation Frequency for Federated Learning With Static and Streaming Datasets

Tuning VGG19 hyperparameters for improved pneumonia classification

A stochastic gradient method with variance control and variable learning rate for Deep Learning

Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems

Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning Across Multiple Assays

Enhancing Skin Disease Classification: A Novel Approach With Tailored Loss Functions And SMOTE Sumeet Ghumare

Prediction of sloshing pressure using image-based deep learning

Phase transitions in the mini-batch size for sparse and dense two-layer neural networks

An Inexact Sequential Quadratic Programming Method for Learning and Control of Recurrent Neural Networks.

ResNet-50 vs. EfficientNet-B0: Multi-Centric Classification of Various Lung Abnormalities Using Deep Learning

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models.

Inexact tensor methods and their application to stochastic convex optimization

Deep learning model to identify potential occurrence of reduced LVEF in patients with implantable loop recorders

Towards accelerating model parallelism in distributed deep learning systems.

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks