High Quality Arabic Concatenative Speech Synthesis

Abdelkader Chabchoub

doi:10.5121/sipij.2011.2403

Abdelkader Chabchoub

Open Access

PDF Available

https://doi.org/10.5121/sipij.2011.2403

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

This paper describes the implementation of TD-PSOLA tools to improve the quality of the Arabic Text-tospeech (TTS) system. This system based on Diphone concatenation with TD-PSOLA modifier synthesizer. This paper describes techniques to improve the precision of prosodic modifications in the Arabic speech synthesis using the TD-PSOLA (Time Domain Pitch Synchronous Overlap-Add) method. This approach is based on the decomposition of the signal into overlapping frames synchronized with the pitch period. The main objective is to preserve the consistency and accuracy of the pitch marks after prosodic modifications of the speech signal and diphone with vowel integrated database adjustment and optimisation.

Highlights

The synthetic voice that imitates human speech from plain text is not a trivial task, since this generally requires great knowledge about the real world, the language, the context where the text comes from, a deep understanding of the semantics of the text content and the relations that underlie all these information
Several speech synthesis systems were developed such as vocoders and LPC synthesizers [5][6], but most of them did not reproduce high quality of synthetic speech when compared with that of PSOLA based systems [7] such as MBROLA synthesizers[8]
The test group consisted of sixteen persons and the previously mentioned two tests were repeated twice to see whether or not the test results will increase by the learning effect which means that the listeners may become accustomed to the synthesized speech they hear and they understand it better after every listening session

Summary

INTRODUCTION

The synthetic voice that imitates human speech from plain text is not a trivial task, since this generally requires great knowledge about the real world, the language, the context where the text comes from, a deep understanding of the semantics of the text content and the relations that underlie all these information. Many research and commercial speech synthesis systems developed have contributed to our understanding of all these phenomena, and have been successful in various respective ways for many applications such as in human-machine interaction, hands and eyes free access of information, interactive voice response systems. TD-PSOLA method (Time Domain Pitch Synchronous Overlap-Add) is the most efficient method to produce criteria of satisfaction speech [9] and is one of the most popular concatenation synthesis techniques nowadays. The Short Time signals (ST signals) are overlapped and added with desired spacing of the ST-signals

Introduction for Arabic language

Database construction

Speech analysis

Speech marks

Reading marks

Synthesis marks

Synthesis speech

RESULTS AND EVALUATION

CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Signal & Image Processing : An International Journal	Publication Date: Dec 31, 2011
Citations: 1	License type: cc-by

R Discovery Prime

High Quality Arabic Concatenative Speech Synthesis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Signal & Image Processing : An International Journal

Lead the way for us

Similar Papers

Automatic pitch marking for speech transformations via TD-PSOLA
...
-
, et. al. ...
08 Sep 1998
08 Sep 1998

Duration modelling and evaluation for Arabic statistical parametric speech synthesis
Imene Zangar ... Zied Mnasri
Multimedia Tools and Applications | VOL. 80
Imene Zangar, et. al.Imene Zangar ... Zied Mnasri
02 Nov 2020
Multimedia Tools and Applications | VOL. 80

Pitch Detection of Speech Synthesis by Using Matlab
Abhishek Nandy
IOSR Journal of Electronics and Communication Engineering | VOL. 8
Abhishek NandyAbhishek Nandy
01 Jan 2013
IOSR Journal of Electronics and Communication Engineering | VOL. 8

Arabic speech synthesis and diacritic recognition
Ilyes Rebai ... Yassine Benayed
International Journal of Speech Technology | VOL. 19
Ilyes Rebai, et. al.Ilyes Rebai ... Yassine Benayed
18 May 2016
International Journal of Speech Technology | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

High Quality Arabic Concatenative Speech Synthesis

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Signal &amp; Image Processing : An International Journal

More From: Signal & Image Processing : An International Journal