Balanced Arabic corpus design for speech synthesis

Aissa Amrouche,Ahcène Abed,Khadidja Nesrine Boubakeur,Kamel Ferrat,Youssouf Bentrcia,Leila Falek

doi:10.1007/s10772-021-09846-8

Abstract

This paper aims to design and validate a phonetically balanced speech corpus for Arabic language. Designing and developing a rich and phonetically balanced corpus in optimal context is one of the key issues in building high quality of text-to-speech synthesis systems. The rich characteristic is in the sense that it must contain all the possible phonemes on the right and left context, whereas the balanced characteristic is in the sense that it respects the phonetic distribution in the language. We propose a new methodology for designing and implementing such corpus for speech synthesis purposes. The paper explains the whole creation process of this corpus, beginning with the design stage, corpus creation, recording phases, and finally the segmentation of the speech corpus. The speech corpus contains 202 sentences with 6174 phonemes. In order to validate the speech corpus, an Arabic speech synthesis system using Hidden Markov Models has been developed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Balanced Arabic corpus design for speech synthesis

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Apr 21, 2021
Citations: 8

Similar Papers

Development of Unit Selection Based Speech Synthesis System
Archana Balyan
SSRN Electronic Journal | VOL. -
Archana BalyanArchana Balyan
01 Jan 2018
SSRN Electronic Journal | VOL. -

Natural speaker-independent Arabic speech recognition system based on Hidden Markov Models using Sphinx tools
Mohammad A M Abushariah ... Othman O Khalifa
-
Mohammad A M Abushariah, et. al.Mohammad A M Abushariah ... Othman O Khalifa
01 May 2010
01 May 2010

Phonetically rich and balanced speech corpus for Arabic speaker-independent continuous automatic speech recognition systems
Mohammad A M Abushariah ... Raja N Ainon
-
Mohammad A M Abushariah, et. al.Mohammad A M Abushariah ... Raja N Ainon
01 May 2010
01 May 2010

Implementation of speech synthesis based on HMM using PADAS database
Krichi Mohamed Khalil ... Cherif Adnan
-
Krichi Mohamed Khalil, et. al.Krichi Mohamed Khalil ... Cherif Adnan
01 Mar 2015
01 Mar 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Balanced Arabic corpus design for speech synthesis

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology