Rethinking of BERT sentence embedding for text classification

Omar Galal,Ahmed H Abdel-Gawad,Mona Farouk

doi:10.1007/s00521-024-10212-3

Abstract

Text classification is a fundamental task in NLP that is used in several real-life tasks and applications. Large pre-trained language models such as BERT achieve state-of-the-art performance in several NLP tasks including text classification tasks. Although BERT boosts text classification performance, the common way of using it for classification lacks many aspects of its advantages. This work rethinks the way of using BERT final layer and hidden layers embeddings by proposing different aggregation architectures for text classification tasks such as sentiment analysis and sarcasm detection. This research also proposes different approaches for using BERT as a feature extractor without fine-tuning whose performance surpasses its fine-tuning counterpart. It also proposes promising multi-task learning aggregation architectures to improve the performance of the related classification problems. The experiments of the different architectures show that freezing BERT can outperform fine-tuning it for sentiment analysis. The experiments also show that multi-task learning while freezing BERT boosts the performance of yet hard tasks such as sarcasm detection. The best-performing models achieved new state-of-the-art performance on the ArSarcasm-v2 dataset for Arabic sarcasm detection and sentiment analysis. For multi-task learning and freezing BERT, a new SOTA F1-score of 64.41 was achieved for the sarcasm detection with a 3.47% improvement and near SOTA FPN of 75.78 for the sentiment classification. For single-task learning, a new SOTA FPN of 75.26 was achieved for the sentiment with a 1.81% improvement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking of BERT sentence embedding for text classification

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Aug 12, 2024
License type: CC BY 4.0

Similar Papers

Multi-level embeddings for processing Arabic social media contents
Leila Moudjari ... Karima Akli-Astouati
Computer Speech & Language | VOL. 70
Leila Moudjari, et. al.Leila Moudjari ... Karima Akli-Astouati
05 May 2021
Computer Speech & Language | VOL. 70

AraXLNet: pre-trained language model for sentiment analysis of Arabic
Alhanouf Alduailej ... Abdulrahman Alothaim
Journal of Big Data | VOL. 9
Alhanouf Alduailej, et. al.Alhanouf Alduailej ... Abdulrahman Alothaim
31 May 2022
Journal of Big Data | VOL. 9

LASTD: A Manually Annotated and Tested Large Arabic Sentiment Tweets Dataset
Kariman Elshakankery ... Mona Farouk
-
Kariman Elshakankery, et. al.Kariman Elshakankery ... Mona Farouk
27 May 2021
27 May 2021

Evaluating Multilingual BERT for Estonian
Claudia Kittask ... Kirill Milintsevich
-
Claudia Kittask, et. al.Claudia Kittask ... Kirill Milintsevich
15 Sep 2020
15 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking of BERT sentence embedding for text classification

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications