Do We Need a Specific Corpus and Multiple High-Performance GPUs for Training the BERT Model? An Experiment on COVID-19 Dataset

Nontakan Nuntachit,Prompong Sugunnasil

doi:10.3390/make4030030

Nontakan Nuntachit, Prompong Sugunnasil

Open Access

https://doi.org/10.3390/make4030030

Copy DOI

Abstract

The COVID-19 pandemic has impacted daily lives around the globe. Since 2019, the amount of literature focusing on COVID-19 has risen exponentially. However, it is almost impossible for humans to read all of the studies and classify them. This article proposes a method of making an unsupervised model called a zero-shot classification model, based on the pre-trained BERT model. We used the CORD-19 dataset in conjunction with the LitCovid database to construct new vocabulary and prepare the test dataset. For NLI downstream task, we used three corpora: SNLI, MultiNLI, and MedNLI. We significantly reduced the training time by 98.2639% to build a task-specific machine learning model, using only one Nvidia Tesla V100. The final model can run faster and use fewer resources than its comparators. It has an accuracy of 27.84%, which is lower than the best-achieved accuracy by 6.73%, but it is comparable. Finally, we identified that the tokenizer and vocabulary more specific to COVID-19 could not outperform the generalized ones. Additionally, it was found that BART architecture affects the classification results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning and Knowledge Extraction	Publication Date: Jul 4, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Do We Need a Specific Corpus and Multiple High-Performance GPUs for Training the BERT Model? An Experiment on COVID-19 Dataset

Abstract

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction

Lead the way for us

Similar Papers

COVID-19 Fake News Detection by Using BERT and RoBERTa models
Tashko Pavlov ... Georgina Mirceva
-
Tashko Pavlov, et. al.Tashko Pavlov ... Georgina Mirceva
23 May 2022
23 May 2022

Generative Adversarial Network for Text-to-Face Synthesis and Manipulation with Pretrained BERT Model
Yutong Zhou ... Nobutaka Shimada
-
Yutong Zhou, et. al.Yutong Zhou ... Nobutaka Shimada
15 Dec 2021
15 Dec 2021

Semantic Similarity Comparison of Word Representation Methods in the Field of Health
Hilal Tekgoz ... Halil Ibrahim Celenli
-
Hilal Tekgoz, et. al.Hilal Tekgoz ... Halil Ibrahim Celenli
15 Sep 2021
15 Sep 2021

Sentiment Classification Algorithm Based on the Cascade of BERT Model and Adaptive Sentiment Dictionary
Ruixue Duan ... Xiulei Liu
Wireless Communications and Mobile Computing | VOL. 2021
Ruixue Duan, et. al.Ruixue Duan ... Xiulei Liu
01 Jan 2020
Wireless Communications and Mobile Computing | VOL. 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Do We Need a Specific Corpus and Multiple High-Performance GPUs for Training the BERT Model? An Experiment on COVID-19 Dataset

Abstract

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction