Stemmer and phonotactic rules to improve n-gram tagger-based indonesian phonemicization

Suyanto Suyanto,Andi Sunyoto,Rezza Nafi Ismail,Ema Rachmawati,Warih Maharani

doi:10.1016/j.jksuci.2021.01.006

Suyanto Suyanto, Andi Sunyoto + Show 3 more

Open Access

https://doi.org/10.1016/j.jksuci.2021.01.006

Copy DOI

Abstract

A phonemicization or grapheme-to-phoneme conversion (G2P) is a process of converting a word into its pronunciation. It is one of the essential components in speech synthesis, speech recognition, and natural language processing. The deep learning (DL)-based state-of-the-art G2P model generally gives low phoneme error rate (PER) as well as word error rate (WER) for high-resource languages, such as English and European, but not for low-resource languages. Therefore, some conventional machine learning (ML)-based G2P models incorporated with specific linguistic knowledge are preferable for low-resource languages. However, these models are poor for several low-resource languages because of various issues. For instance, an Indonesian G2P model works well for roots but gives a high PER for derivatives. Most errors come from the ambiguities of some roots and derivative words containing four prefixes: 〈ber〉, 〈meng〉, 〈peng〉, and 〈ter〉. In this research, an Indonesian G2P model based on n-gram combined with stemmer and phonotactic rules (NGTSP) is proposed to solve those problems. An investigation based on 5-fold cross-validation, using 50 k Indonesian words, informs that the proposed NGTSP gives a much lower PER of 0.78% than the state-of-the-art Transformer-based G2P model (1.14%). Besides, it also provides a much faster processing time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of King Saud University - Computer and Information Sciences	Publication Date: Jan 14, 2021
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Stemmer and phonotactic rules to improve n-gram tagger-based indonesian phonemicization

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences

Lead the way for us

Similar Papers

Transformer Based Grapheme-to-Phoneme Conversion
Sevinj Yolchuyeva ... Bálint Gyires-Tóth
-
Sevinj Yolchuyeva, et. al.Sevinj Yolchuyeva ... Bálint Gyires-Tóth
15 Sep 2019
15 Sep 2019

Grapheme to Phoneme Conversion for Malayalam Speech Using Encoder-Decoder Architecture
R Priyamvada ... B Premjith
-
R Priyamvada, et. al.R Priyamvada ... B Premjith
01 Jan 2021
01 Jan 2021

Causal analysis of Speech Recognition failure in adverse environments
Guojun Zhou ... Sangita Sharma
-
Guojun Zhou, et. al.Guojun Zhou ... Sangita Sharma
01 May 2002
01 May 2002

A Deep Learning Automatic Speech Recognition Model for Shona Language
Leslie Wellington Sirora ... Mainford Mutandavari
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 12
Leslie Wellington Sirora, et. al.Leslie Wellington Sirora ... Mainford Mutandavari
25 Sep 2024
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stemmer and phonotactic rules to improve n-gram tagger-based indonesian phonemicization

Abstract

Talk to us

Similar Papers

More From: Journal of King Saud University - Computer and Information Sciences