Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Ahmad Muammar Fanani,Suyanto Suyanto

doi:10.1016/j.procs.2021.01.058

Abstract

Abstract Syllabication or syllabification is an activity to detect syllable boundaries in a word. There are two main ways for automatic syllabification, namely rule-based and data-driven. The rule-based approach is based on the general principle of syllabification, while the data-driven method uses a set of syllabified words to create a syllabification of unknown words. Research on syllabification of words has been done a lot. However, most of these studies only deal with the formal words but still a few studies for named entities. Besides, named entities tend to be more complicated than the regular words. In this research, a syntactic n-Gram is proposed and investigated to syllabify the named entities since it is developed based on the n-gram that has an excellent accuracy and tends to be consistent with various languages. Evaluation on 20 k named-entities based on 4-fold cross-validation show that the proposed model gives a competitive syllable error rate (SER) compare to another similar n-gram-based model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2021
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Research on Syllable-Based Language Model in Malay Speech Recognition
Xiangfeng Wei ... Yi Yuan
-
Xiangfeng Wei, et. al.Xiangfeng Wei ... Yi Yuan
27 Oct 2022
27 Oct 2022

Indonesian graphemic syllabification using a nearest neighbour classifier and recovery procedure
Edwina Anky Parande ... Suyanto Suyanto
International Journal of Speech Technology | VOL. 22
Edwina Anky Parande, et. al.Edwina Anky Parande ... Suyanto Suyanto
08 Nov 2018
International Journal of Speech Technology | VOL. 22

Syllabification of English Words by Pashto Speakers
Shahabullah ... Arshad Ali Khan
Global Language Review | VOL. 5
Shahabullah, et. al. Shahabullah ... Arshad Ali Khan
30 Mar 2020
Global Language Review | VOL. 5

Indonesian Graphemic Syllabification Using n-Gram Tagger with State-Elimination
Rezza Nafi Ismail ... Suyanto Suyanto
-
Rezza Nafi Ismail, et. al.Rezza Nafi Ismail ... Suyanto Suyanto
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Syllabification Model of Indonesian Language Named-Entity Using Syntactic n-Gram

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science