A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition

Mashael Al-Duwais,Abdulmalik Al-Salman,Hend Al-Khalifa

doi:10.3390/electronics13173574

Abstract

Multilingual large language models (MLLMs) have demonstrated remarkable performance across a wide range of cross-lingual Natural Language Processing (NLP) tasks. The emergence of MLLMs made it possible to achieve knowledge transfer from high-resource to low-resource languages. Several MLLMs have been released for cross-lingual transfer tasks. However, no systematic evaluation comparing all models for Arabic cross-lingual Named-Entity Recognition (NER) is available. This paper presents a benchmark evaluation to empirically investigate the performance of the state-of-the-art multilingual large language models for Arabic cross-lingual NER. Furthermore, we investigated the performance of different MLLMs adaptation methods to better model the Arabic language. An error analysis of the different adaptation methods is presented. Our experimental results indicate that GigaBERT outperforms other models for Arabic cross-lingual NER, while language-adaptive pre-training (LAPT) proves to be the most effective adaptation method across all datasets. Our findings highlight the importance of incorporating language-specific knowledge to enhance the performance in distant language pairs like English and Arabic.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Sep 9, 2024
License type: CC BY 4.0

Similar Papers

DeIDNER Corpus: Annotation of Clinical Discharge Summary Notes for Named Entity Recognition Using BRAT Tool.
Mahanazuddin Syed ... Melody L Greer
Studies in health technology and informatics | VOL. 281
Mahanazuddin Syed, et. al.Mahanazuddin Syed ... Melody L Greer
27 May 2021
Studies in health technology and informatics | VOL. 281

A Survey of Arabic Named Entity Recognition and Classification
Khaled Shaalan
Computational Linguistics | VOL. 40
Khaled ShaalanKhaled Shaalan
01 Jun 2014
Computational Linguistics | VOL. 40

Vocabulary-Enhanced Named Entity Recognition and its Application on Distribution Network Maintenance
Yu Wang ... Xiong-Yong Jiang
Journal of Circuits, Systems and Computers | VOL. -
Yu Wang, et. al.Yu Wang ... Xiong-Yong Jiang
25 Nov 2024
Journal of Circuits, Systems and Computers | VOL. -

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features
Saikiranmai Gorla ... Aruna Malapati
Information | VOL. 11
Saikiranmai Gorla, et. al.Saikiranmai Gorla ... Aruna Malapati
02 Feb 2020
Information | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A Benchmark Evaluation of Multilingual Large Language Models for Arabic Cross-Lingual Named-Entity Recognition

Abstract

Published Version

Talk to us

Similar Papers

More From: Electronics