Pali Speech Synthesis using HMM

Kittikan Charoenrattana,Pusadee Seresangtakul

doi:10.1109/kst51265.2021.9415759

Abstract

In this paper, we present a Pali (Thai) speech synthesis system using the parametric statistical approach. To develop the system, we recorded 40 Pali chants. Data were extracted and represented by the Mel frequency cepstral coefficients and fundamental frequency (F0), and labeled by force alignment. These parameters were modeled using the hidden Markov model (HMM). To generate synthesized speech, the input text was converted into context-dependent phonemes and generated speech parameters from the trained HMM model. The resulting parameters were used for synthesizing speech using a speech vocoder. In the study, we modeled two speech synthesized models: the first model represents tone in syllable levels (tone-syllable) and the second model represents tone in phoneme levels (tone-phoneme). To evaluate the naturalness of the proposed system, we asked 13 users to participate in listening tests comparing the two synthesized speech models (tone-syllable and tone-phoneme models) and original speech. The results, expressing naturalness in mean opinion score (MOS), were 4.21, 3.25, and 3.32 (from 5) for the original, tone-syllable, and tone-phoneme synthesized speeches, respectively. We also conducted an objective test in which we calculated the cepstral distance between the cepstral coefficients of the original speeches and synthesized speeches. The average distances were 3.67 and 3.60 for the tone-syllable and the tone-phoneme models, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pali Speech Synthesis using HMM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Non-intrusive objective speech quality assessment using a combination of MFCC, PLP and LSF features
Rajesh Kumar Dubey ... Arun Kumar
-
Rajesh Kumar Dubey, et. al.Rajesh Kumar Dubey ... Arun Kumar
01 Dec 2013
01 Dec 2013

An Isarn dialect HMM-based text-to-speech system
Pongsathon Janyoi ... Pusadee Seresangtakul
-
Pongsathon Janyoi, et. al.Pongsathon Janyoi ... Pusadee Seresangtakul
01 Nov 2017
01 Nov 2017

Exploration of vowel onset and offset points for hybrid speech segmentation
Biswajit Dev Sarma ... S Aswin Shanmugam
-
Biswajit Dev Sarma, et. al.Biswajit Dev Sarma ... S Aswin Shanmugam
01 Nov 2015
01 Nov 2015

Isarn Dialect Speech Synthesis using HMM with syllable-context features
Pongsathon Janyoi ... Pusadee Seresangtakul
ECTI Transactions on Computer and Information Technology (ECTI-CIT) | VOL. 12
Pongsathon Janyoi, et. al.Pongsathon Janyoi ... Pusadee Seresangtakul
29 Nov 2018
ECTI Transactions on Computer and Information Technology (ECTI-CIT) | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pali Speech Synthesis using HMM

Abstract

Talk to us

Similar Papers