Comparative Analysis of State-of-the-Art Q A Models: BERT, RoBERTa, DistilBERT, and ALBERT on SQuAD v2 Dataset

Cem Özkurt

doi:10.69882/adba.chf.2024073

Abstract

In the rapidly evolving landscape of natural language processing (NLP) and artificial intelligence, recent years have witnessed significant advancements, particularly in text-based question-answering (QA) systems. The Stanford Question Answering Dataset (SQuAD v2) has emerged as a prominent benchmark, offering diverse language understanding challenges. This study conducts a thorough examination of cutting-edge QA models—BERT, DistilBERT, RoBERTa, and ALBERT—each featuring distinct architectures, focusing on their training and performance on SQuAD v2. The analysis aims to uncover the unique strengths of each model, providing insights into their capabilities and exploring the impact of different training techniques on their performance. The primary objective is to enhance our understanding of text-based QA systems' evolution and their effectiveness in real-world scenarios. The results of this comparative study are poised to influence the utilization and development of these models in both industry and research. The investigation meticulously evaluates BERT, ALBERT, RoBERTa, and DistilBERT QA models using the SQuAD v2 dataset, emphasizing instances of accurate responses and identifying areas where completeness may be lacking. This nuanced exploration contributes to the ongoing discourse on the advancement of text-based question-answering systems, shedding light on the strengths and limitations of each QA model. Based on the results obtained, ALBERT achieved an exact match of 86.85% and an F1 score of 89.91% on the SQuAD v2 dataset, demonstrating superior performance in both answerable ('HasAns') and unanswerable ('NoAns') questions. BERT and RoBERTa also showed strong performance, while DistilBERT lagged slightly behind. This study provides a significant contribution to the advancement of text-based question-answering systems, offering insights that can shape the utilization of these models in both industry and research domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Analysis of State-of-the-Art Q A Models: BERT, RoBERTa, DistilBERT, and ALBERT on SQuAD v2 Dataset

Abstract

Talk to us

Similar Papers

More From: Chaos and Fractals

Lead the way for us

Journal: Chaos and Fractals	Publication Date: Jul 31, 2024
License type: cc-by-nc

Similar Papers

Eliciting Bias in Question Answering Models through Ambiguity
Andrew Mao ... Jordan Boyd-Graber
-
Andrew Mao, et. al.Andrew Mao ... Jordan Boyd-Graber
01 Jan 2020
01 Jan 2020

A Survey of Text Question Answering Techniques
Poonam Gupta ... Vishal Gupta
International Journal of Computer Applications | VOL. 53
Poonam Gupta, et. al.Poonam Gupta ... Vishal Gupta
25 Sep 2012
International Journal of Computer Applications | VOL. 53

Unanswerable Question Correction in Question Answering over Personal Knowledge Base
An-Zi Yen ... Hsin-Hsi Chen
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
An-Zi Yen, et. al.An-Zi Yen ... Hsin-Hsi Chen
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Computer Security Based Question Answering System with IR and Google BERT
Pragya Agrawal ... H R Mamatha
-
Pragya Agrawal, et. al.Pragya Agrawal ... H R Mamatha
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Analysis of State-of-the-Art Q A Models: BERT, RoBERTa, DistilBERT, and ALBERT on SQuAD v2 Dataset

Abstract

Talk to us

Similar Papers

More From: Chaos and Fractals