MizBERT: A Mizo BERT Model

Robert Lalramhluna,Sandeep Dash,Dr.Partha Pakray

doi:10.1145/3666003

Abstract

This research investigates the utilization of pre-trained BERT transformers within the context of the Mizo language. BERT, an abbreviation for Bidirectional Encoder Representations from Transformers, symbolizes Google’s forefront neural network approach to Natural Language Processing (NLP), renowned for its remarkable performance across various NLP tasks. However, its efficacy in handling low-resource languages such as Mizo remains largely unexplored. In this study, we introduce MizBERT , a specialized Mizo language model. Through extensive pre-training on a corpus collected from diverse online platforms, MizBERT has been tailored to accommodate the nuances of the Mizo language. Evaluation of MizBERT’s capabilities is conducted using two primary metrics: masked language modeling and perplexity, yielding scores of 76.12% and 3.2565, respectively. Additionally, its performance in a text classification task is examined. Results indicate that MizBERT outperforms both the Multilingual BERT model and the Support Vector Machine algorithm, achieving an accuracy of 98.92%. This underscores MizBERT’s proficiency in understanding and processing the intricacies inherent in the Mizo language.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MizBERT: A Mizo BERT Model

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Jun 26, 2024
License type: mit

Similar Papers

Learned Text Representation for Amharic Information Retrieval and Natural Language Processing
Tilahun Yeshambel ... Josiane Mothe
Information | VOL. 14
Tilahun Yeshambel, et. al.Tilahun Yeshambel ... Josiane Mothe
20 Mar 2023
Information | VOL. 14

German BERT Model for Legal Named Entity Recognition
Harshil Darji ... Michael Granitzer
-
Harshil Darji, et. al.Harshil Darji ... Michael Granitzer
01 Jan 2023
01 Jan 2023

Unmasking the Mask – Evaluating Social Biases in Masked Language Models
Masahiro Kaneko ... Danushka Bollegala
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Masahiro Kaneko, et. al.Masahiro Kaneko ... Danushka Bollegala
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Evaluating large language models for health-related text classification tasks with public social media data.
Yuting Guo ... Abeed Sarker
Journal of the American Medical Informatics Association : JAMIA | VOL. -
Yuting Guo, et. al.Yuting Guo ... Abeed Sarker
09 Aug 2024
Journal of the American Medical Informatics Association : JAMIA | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MizBERT: A Mizo BERT Model

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing