ParsBERT: Transformer-based Model for Persian Language Understanding

Mehrdad Farahani,Mohammad Manthouri,Marzieh Farahani,Mohammad Gharachorloo

doi:10.1007/s11063-021-10528-4

Abstract

The surge of pre-trained language models has begun a new era in the field of Natural Language Processing (NLP) by allowing us to build powerful language models. Among these models, Transformer-based models such as BERT have become increasingly popular due to their state-of-the-art performance. However, these models are usually focused on English, leaving other languages to multilingual models with limited resources. This paper proposes a monolingual BERT for the Persian language (ParsBERT), which shows its state-of-the-art performance compared to other architectures and multilingual models. Also, since the amount of data available for NLP tasks in Persian is very restricted, a massive dataset for different NLP tasks as well as pre-training the model is composed. ParsBERT obtains higher scores in all datasets, including existing ones and gathered ones, and improves the state-of-the-art performance by outperforming both multilingual BERT and other prior works in Sentiment Analysis, Text Classification, and Named Entity Recognition tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ParsBERT: Transformer-based Model for Persian Language Understanding

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters

Lead the way for us

Journal: Neural Processing Letters	Publication Date: Oct 8, 2021
Citations: 72

Similar Papers

Evaluating Multilingual BERT for Estonian
Claudia Kittask ... Kirill Milintsevich
-
Claudia Kittask, et. al.Claudia Kittask ... Kirill Milintsevich
15 Sep 2020
15 Sep 2020

Bangla-BERT: Transformer-Based Efficient Model for Transfer Learning and Language Understanding
M Kowsher ... Mohammad Shamsul Arefin
IEEE Access | VOL. 10
M Kowsher, et. al.M Kowsher ... Mohammad Shamsul Arefin
01 Jan 2021
IEEE Access | VOL. 10

A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language Models
Dang Van Thin ... Duong Ngoc Hao
Vietnam Journal of Computer Science | VOL. 11
Dang Van Thin, et. al.Dang Van Thin ... Duong Ngoc Hao
07 Dec 2023
Vietnam Journal of Computer Science | VOL. 11

Keynote - AI for the Public Sector and the Case of Legal NLP
Matthias Stürmer
-
Matthias StürmerMatthias Stürmer
03 Apr 2023
03 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ParsBERT: Transformer-based Model for Persian Language Understanding

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters