Weight Averaging and re-adjustment ensemble for QRCD

Esha Aftab,Muhammad Kamran Malik

doi:10.1016/j.jksuci.2024.102037

Abstract

Question Answering (QA) is a prominent task in the field of Natural Language Processing (NLP) with extensive applications. Recently, there has been a notable surge in research interest concerning the development of QA systems for the Holy Qur’an, an Islamic religious text. The Qur’an Reading Comprehension Dataset (QRCD) Malhas and Elsayed (2020) is a highly commendable effort in this respect. It stands as the first benchmark dataset specifically designed for a set of directly answerable questions from the Qur’an. Each question in the dataset is meticulously labeled with all potential answers sourced from the Holy Qur’an. From our perspective, the main challenge in QRCD stems from the limited volume of training data it offers. As a solution we propose an innovative approach to build a Deep Neural Network (DNN) ensemble, centered around Ara-Electra model (Antoun et al., 2021), that we called Weight Averaging and Re-adjustment (WAR) model. The model is constructed by computing running averages of all model states that evolve during a single training session and ensuring that model weights are readjusted prior to each training epoch, in order to hold it back from over fitting the training data. The scheme results in a single standalone model that exhibits the benefits of multi-model ensembles. It is distinguished from other ensembles proposed for QRCD that accumulates outputs from multiple expert models and employs classic techniques like hard voting or score averaging on output probabilities to build unified results. Each expert model costs individual training time and compute resources. The WAR model outperforms existing systems with improved generalization over unseen data. It achieves F1, partial Reciprocal Rank (pRR), and exact-match (EM) scores of 0.567, 0.60 and 0.29 respectively, exceeding best reported QRCD scores by 3%, 1.5% and 0.69% respectively. Notably, we are comparing our results with the top scores from different models, highlighting our model’s consistent performance across all three metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Weight Averaging and re-adjustment ensemble for QRCD

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Similar Papers

FTBME: feature transferring based multi-model ensemble
A Yongquan Yang ... E Zhongxi Zheng
Multimedia Tools and Applications | VOL. 79
A Yongquan Yang, et. al.A Yongquan Yang ... E Zhongxi Zheng
12 Mar 2020
Multimedia Tools and Applications | VOL. 79

THQuAD: Turkish Historic Question Answering Dataset for Reading Comprehension
Fatih Soygazi ... Okan Ciftci
-
Fatih Soygazi, et. al.Fatih Soygazi ... Okan Ciftci
15 Sep 2021
15 Sep 2021

Fine-Grained Quran Dataset
Mohamed Osman ... Mohammad Alhawarat
International Journal of Advanced Computer Science and Applications | VOL. 6
Mohamed Osman, et. al.Mohamed Osman ... Mohammad Alhawarat
01 Jan 2015
International Journal of Advanced Computer Science and Applications | VOL. 6

Deep neural network ensemble for reducing artificial noise in bandwidth extension
Kyoungjin Noh ... Joon-Hyuk Chang
Digital Signal Processing | VOL. 102
Kyoungjin Noh, et. al.Kyoungjin Noh ... Joon-Hyuk Chang
05 May 2020
Digital Signal Processing | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Weight Averaging and re-adjustment ensemble for QRCD

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences