Training Epochs Research Articles

Question Answering (QA) is a prominent task in the field of Natural Language Processing (NLP) with extensive applications. Recently, there has been a notable surge in research interest concerning the development of QA systems for the Holy Qur’an, an Islamic religious text. The Qur’an Reading Comprehension Dataset (QRCD) Malhas and Elsayed (2020) is a highly commendable effort in this respect. It stands as the first benchmark dataset specifically designed for a set of directly answerable questions from the Qur’an. Each question in the dataset is meticulously labeled with all potential answers sourced from the Holy Qur’an. From our perspective, the main challenge in QRCD stems from the limited volume of training data it offers. As a solution we propose an innovative approach to build a Deep Neural Network (DNN) ensemble, centered around Ara-Electra model (Antoun et al., 2021), that we called Weight Averaging and Re-adjustment (WAR) model. The model is constructed by computing running averages of all model states that evolve during a single training session and ensuring that model weights are readjusted prior to each training epoch, in order to hold it back from over fitting the training data. The scheme results in a single standalone model that exhibits the benefits of multi-model ensembles. It is distinguished from other ensembles proposed for QRCD that accumulates outputs from multiple expert models and employs classic techniques like hard voting or score averaging on output probabilities to build unified results. Each expert model costs individual training time and compute resources. The WAR model outperforms existing systems with improved generalization over unseen data. It achieves F1, partial Reciprocal Rank (pRR), and exact-match (EM) scores of 0.567, 0.60 and 0.29 respectively, exceeding best reported QRCD scores by 3%, 1.5% and 0.69% respectively. Notably, we are comparing our results with the top scores from different models, highlighting our model’s consistent performance across all three metrics.

To efficiently predict the mechanical parameters of granular soil based on its random micro-structure, this study proposed a novel approach combining numerical simulation and machine learning algorithms. Initially, 3500 simulations of one-dimensional compression tests on coarse-grained sand using the three-dimensional (3D) discrete element method (DEM) were conducted to construct a database. In this process, the positions of the particles were randomly altered, and the particle assemblages changed. Interestingly, besides confirming the influence of particle size distribution parameters, the stress-strain curves differed despite an identical gradation size statistic when the particle position varied. Subsequently, the obtained data were partitioned into training, validation, and testing datasets at a 7:2:1 ratio. To convert the DEM model into a multi-dimensional matrix that computers can recognize, the 3D DEM models were first sliced to extract multi-layer two-dimensional (2D) cross-sectional data. Redundant information was then eliminated via gray processing, and the data were stacked to form a new 3D matrix representing the granular soil's fabric. Subsequently, utilizing the Python language and Pytorch framework, a 3D convolutional neural networks (CNNs) model was developed to establish the relationship between the constrained modulus obtained from DEM simulations and the soil's fabric. The mean squared error (MSE) function was utilized to assess the loss value during the training process. When the learning rate (LR) fell within the range of 10-5–10-1, and the batch sizes (BSs) were 4, 8, 16, 32, and 64, the loss value stabilized after 100 training epochs in the training and validation dataset. For BS = 32 and LR = 10-3, the loss reached a minimum. In the testing set, a comparative evaluation of the predicted constrained modulus from the 3D CNNs versus the simulated modulus obtained via DEM reveals a minimum mean absolute percentage error (MAPE) of 4.43% under the optimized condition, demonstrating the accuracy of this approach. Thus, by combining DEM and CNNs, the variation of soil's mechanical characteristics related to its random fabric would be efficiently evaluated by directly tracking the particle assemblages.

Training Epochs Research Articles

Related Topics

Articles published on Training Epochs

Life regression based patch slimming for vision transformers

BiLSTM-CNN with fixed weight approach for tracking speech articulatory features

Evaluating the stealth of reinforcement learning-based cyber attacks against unknown scenarios using knowledge transfer techniques

BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices.

A bilateral filtering-based image enhancement for Alzheimer disease classification using CNN.

EH-former: Regional easy-hard-aware transformer for breast lesion segmentation in ultrasound images

On-device edge-learning for cardiac abnormality detection using a bio-inspired and spiking shallow network

Bandit-NAS: Bandit sampling and training method for Neural Architecture Search

Flexible and Energy-Efficient Synaptic Transistor with Quasi-Linear Weight Update Protocol by Inkjet Printing of Orientated Polar-Electret/High-k Oxide Composite Dielectric.

Principal Component Networks: Utilizing Low-Rank Activation Structure to Reduce Parameters Early in Training

The principle of minimum pressure gradient: An alternative basis for physics-informed learning of incompressible fluid mechanics

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising.

Weight Averaging and re-adjustment ensemble for QRCD

Prediction of constrained modulus for granular soil using 3D discrete element method and convolutional neural networks

DEW: A wavelet approach of rare sound event detection.

DenseNet Architecture for Efficient and Accurate Recognition of Javanese Script Hanacaraka Character

Control-Etched Ti3C2Tx MXene Nanosheets for a Low-Voltage-Operating Flexible Memristor for Efficient Neuromorphic Computation.

Deep learning for predictive window operation modeling in open-plan offices

United We Stand: Using Epoch-Wise Agreement of Ensembles to Combat Overfit

Quality Metrics of Automated Machinery in Potato Plant Cultivation for Breeding and Seed Production

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Training Epochs Research Articles

Related Topics

Articles published on Training Epochs

Life regression based patch slimming for vision transformers

BiLSTM-CNN with fixed weight approach for tracking speech articulatory features

Evaluating the stealth of reinforcement learning-based cyber attacks against unknown scenarios using knowledge transfer techniques

BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices.

A bilateral filtering-based image enhancement for Alzheimer disease classification using CNN.

EH-former: Regional easy-hard-aware transformer for breast lesion segmentation in ultrasound images

On-device edge-learning for cardiac abnormality detection using a bio-inspired and spiking shallow network

Bandit-NAS: Bandit sampling and training method for Neural Architecture Search

Flexible and Energy-Efficient Synaptic Transistor with Quasi-Linear Weight Update Protocol by Inkjet Printing of Orientated Polar-Electret/High-k Oxide Composite Dielectric.

Principal Component Networks: Utilizing Low-Rank Activation Structure to Reduce Parameters Early in Training

The principle of minimum pressure gradient: An alternative basis for physics-informed learning of incompressible fluid mechanics

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising.

Weight Averaging and re-adjustment ensemble for QRCD

Prediction of constrained modulus for granular soil using 3D discrete element method and convolutional neural networks

DEW: A wavelet approach of rare sound event detection.

DenseNet Architecture for Efficient and Accurate Recognition of Javanese Script Hanacaraka Character

Control-Etched Ti3C2Tx MXene Nanosheets for a Low-Voltage-Operating Flexible Memristor for Efficient Neuromorphic Computation.

Deep learning for predictive window operation modeling in open-plan offices

United We Stand: Using Epoch-Wise Agreement of Ensembles to Combat Overfit

Quality Metrics of Automated Machinery in Potato Plant Cultivation for Breeding and Seed Production