Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi

Abhishek Velankar,Raviraj Joshi,Hrushikesh Patil

doi:10.1007/978-3-031-20650-4_10

Abstract

Transformers are the most eminent architectures used for a vast range of Natural Language Processing tasks. These models are pre-trained over a large text corpus and are meant to serve state-of-the-art results over tasks like text classification. In this work, we conduct a comparative study between monolingual and multilingual BERT models. We focus on the Marathi language and evaluate the models on the datasets for hate speech detection, sentiment analysis and simple text classification in Marathi. We use standard multilingual models such as mBERT, indicBERT and xlm-RoBERTa and compare with MahaBERT, MahaALBERT and MahaRoBERTa, the monolingual models for Marathi. We further show that Marathi monolingual models outperform the multilingual BERT variants on five different downstream fine-tuning experiments. We also evaluate sentence embeddings from these models by freezing the BERT encoder layers. We show that monolingual MahaBERT based models provide rich representations as compared to sentence embeddings from multi-lingual counterparts. However, we observe that these embeddings are not generic enough and do not work well on out of domain social media datasets. We consider two Marathi hate speech datasets L3Cube-MahaHate, HASOC-2021, a Marathi sentiment classification dataset L3Cube-MahaSent, and Marathi Headline, Articles classification datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Toxic Comment Classification Model Based on Ensemble
Jian Xu ... Yuqing Zhai
Journal of Physics: Conference Series | VOL. 1873
Jian Xu, et. al.Jian Xu ... Yuqing Zhai
01 Apr 2021
Journal of Physics: Conference Series | VOL. 1873

Are the Multilingual Models Better? Improving Czech Sentiment with Transformers
Pavel PˇRib´AˇN ... Josef Steinberger
-
Pavel PˇRib´AˇN, et. al.Pavel PˇRib´AˇN ... Josef Steinberger
01 Jan 2020
01 Jan 2020

NASca and NASes: Two Monolingual Pre-Trained Models for Abstractive Summarization in Catalan and Spanish
Vicent Ahuir ... Encarna Segarra
Applied Sciences | VOL. 11
Vicent Ahuir, et. al.Vicent Ahuir ... Encarna Segarra
22 Oct 2021
Applied Sciences | VOL. 11

Improving sentence representation for vietnamese natural language understanding using optimal transport
Phu Xuan-Vinh Nguyen ... Kiet Van Nguyen
Journal of Intelligent & Fuzzy Systems | VOL. -
Phu Xuan-Vinh Nguyen, et. al.Phu Xuan-Vinh Nguyen ... Kiet Van Nguyen
27 Jun 2023
Journal of Intelligent & Fuzzy Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi

Abstract

Talk to us

Similar Papers