Developing a Singlish Neural Language Model using ELECTRA

Galangkangin Gotera,Yugo Kartono Isal,Radityo Eko Prasojo

doi:10.1109/icacsis56558.2022.9923521

Abstract

We develop and benchmark a Singlish pretrained neural language model. To this end, we build a novel 3 GB Singlish freetext dataset collected through various Singaporean websites. Then, we leverage ELECTRA (Efficiently Learning an Encoder that Classifies Token Replacements Accurately) to train a transformer-based Singlish language model. ELECTRA is chosen due to its resource-efficiency to better ensure reproducibility. We further build two text classification datasets in Singlish: sentiment analysis and language identification. We use the two datasets to fine-tune our ELECTRA model and benchmark the results against other available pretrained models in English and Singlish. Our experiments show that our Singlish ELECTRA model is competitive against the best open-source models we found despite being pretrained within a significantly less amount of time. We publicly release the benchmarking dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing a Singlish Neural Language Model using ELECTRA

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application of Transformer-Based Language Models to Detect Hate Speech in Social Media
Swapnanil Mukherjee ... Sujit Das
Journal of Computational and Cognitive Engineering | VOL. 2
Swapnanil Mukherjee, et. al.Swapnanil Mukherjee ... Sujit Das
17 Dec 2021
Journal of Computational and Cognitive Engineering | VOL. 2

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
...
-
, et. al. ...
23 Oct 2021
23 Oct 2021

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
Wanyu Du ... Yangfeng Ji
-
Wanyu Du, et. al.Wanyu Du ... Yangfeng Ji
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing a Singlish Neural Language Model using ELECTRA

Abstract

Talk to us

Similar Papers