Fine-Tuning BERT-Based Pre-Trained Models for Arabic Dependency Parsing

Sharefah Al-Ghamdi,Hend Al-Khalifa,Abdulmalik Al-Salman

doi:10.3390/app13074225

Abstract

With the advent of pre-trained language models, many natural language processing tasks in various languages have achieved great success. Although some research has been conducted on fine-tuning BERT-based models for syntactic parsing, and several Arabic pre-trained models have been developed, no attention has been paid to Arabic dependency parsing. In this study, we attempt to fill this gap and compare nine Arabic models, fine-tuning strategies, and encoding methods for dependency parsing. We evaluated three treebanks to highlight the best options and methods for fine-tuning Arabic BERT-based models to capture syntactic dependencies in the data. Our exploratory results show that the AraBERTv2 model provides the best scores for all treebanks and confirm that fine-tuning to the higher layers of pre-trained models is required. However, adding additional neural network layers to those models drops the accuracy. Additionally, we found that the treebanks have differences in the encoding techniques that give the highest scores. The analysis of the errors obtained by the test examples highlights four issues that have an important effect on the results: parse tree post-processing, contextualized embeddings, erroneous tokenization, and erroneous annotation. This study reveals a direction for future research to achieve enhanced Arabic BERT-based syntactic parsing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 27, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Fine-Tuning BERT-Based Pre-Trained Models for Arabic Dependency Parsing

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Multi-Encoder Transformer for Korean Abstractive Text Summarization
Youhyun Shin
IEEE Access | VOL. 11
Youhyun ShinYouhyun Shin
01 Jan 2023
IEEE Access | VOL. 11

An Efficient Long Chinese Text Sentiment Analysis Method Using BERT-Based Models with BiGRU
Deming Sheng ... Jingling Yuan
-
Deming Sheng, et. al.Deming Sheng ... Jingling Yuan
05 May 2021
05 May 2021

HinPLMs: Pre-trained Language Models for Hindi
Xixuan Huang ... Suifu Gan
-
Xixuan Huang, et. al.Xixuan Huang ... Suifu Gan
11 Dec 2021
11 Dec 2021

Extracting Sentence Embeddings from Pretrained Transformer Models
Lukas Stankevičius ... Mantas Lukoševičius
Applied Sciences | VOL. 14
Lukas Stankevičius, et. al.Lukas Stankevičius ... Mantas Lukoševičius
02 Oct 2024
Applied Sciences | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-Tuning BERT-Based Pre-Trained Models for Arabic Dependency Parsing

Abstract

Talk to us

Similar Papers

More From: Applied Sciences