Arabic grapheme-to-phoneme conversion based on joint multi-gram model

El-Hadi Cherifi,Mhania Guerti

doi:10.1007/s10772-020-09779-8

Abstract

Grapheme-to-phoneme conversion (G2P) process—which is is a necessary part of text-to-speech (TTS) systems—aims to predict a sequence of phonemes from a sequence of graphemes. For most languages, this task is limited to concatenated segment pronunciations in the case of words, and concatenated pronunciations of words in the case of a statement. This approach, however, is not viable for some languages, such as the Arabic language, where transitions between sounds in the word and between words in the statement cause changes in their pronunciation according to several considerations depending on the orthographic, phonetic and phonological context. In this work, we propose an approach for Arabic G2P Conversion based on a probabilistic method: joint multi-gram model (JMM). In this approach, we do not need to explain all the G2P correspondence anomalies that we will detail in this paper, but all this knowledge will be included implicitly at the learning stage. We discuss the results and experiments of this method applied on a pronunciation dictionary of the most commonly used Arabic words, and on carefully chosen and annotated texts for continuous speech. The current results do not surpass the baseline system but point the way towards future innovations. Indeed, these results are quite satisfactory on the dictionary adopted for test and learning, with a score of just over 10% error rate on the transcription of phonemes (phoneme error rate), and on the corpus of continuous speech, with a score of just over 11% of PER.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Arabic grapheme-to-phoneme conversion based on joint multi-gram model

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Jan 2, 2021
Citations: 4

Similar Papers

Conditional Random Fields Applied to Arabic Orthographic-Phonetic Transcription
...
Archives of Acoustics | VOL. 46
, et. al. ...
06 Nov 2023
Archives of Acoustics | VOL. 46

Transformer Based Grapheme-to-Phoneme Conversion
Sevinj Yolchuyeva ... Géza Németh
-
Sevinj Yolchuyeva, et. al.Sevinj Yolchuyeva ... Géza Németh
15 Sep 2019
15 Sep 2019

Entropic Analysis of Garhwali Text
Manoj Kumar Riyal ... Rajeev Kumar Upadhyay
-
Manoj Kumar Riyal, et. al.Manoj Kumar Riyal ... Rajeev Kumar Upadhyay
20 Sep 2020
20 Sep 2020

Can continuous speech recognizers handle isolated speech?
Fil Alleva ... Li Jiang
Speech Communication | VOL. 26
Fil Alleva, et. al.Fil Alleva ... Li Jiang
01 Nov 1998
Speech Communication | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Arabic grapheme-to-phoneme conversion based on joint multi-gram model

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology