Diphone Databases for Lithuanian Text-to-Speech Synthesis

Pijus Kasparaitis

doi:10.15388/informatica.2005.093

Abstract

One of the components of the text-to-speech synthesis system is the database of sounds. Two Lithuanian diphone databases in the MBROLA format are presented in this paper. The list of phonemes and the list of diphones necessary for Lithuanian text-to-speech synthesis are described. The problem of phoneme combinations that are not used in the Lithuanian language is dealt with in the work. Also, the article is concerned with transcribing a Lithuanian text.

Highlights

In his previous articles (Kasparaitis, 2001a; Kasparaitis, 2001b) the present author described the Lithuanian text-to-speech synthesizer “Aistis” in which 480 phonetic units of various lengths were used: parts of phonemes, allophones, diphthongs and mixed diphthongs, most often consonants with the beginnings of vowels and allophones of vowels are made use of
8464 combinations may be built on the basis of 92 phonemes, it is impossible to have all the diphones made from these combinations in the Lithuanian language
The Project is aimed at creating a set of speech synthesizers and accumulating databases of speech sounds of as many languages as possible

Summary

Introduction

In his previous articles (Kasparaitis, 2001a; Kasparaitis, 2001b) the present author described the Lithuanian text-to-speech synthesizer “Aistis” in which 480 phonetic units of various lengths were used: parts of phonemes, allophones, diphthongs and mixed diphthongs, most often consonants with the beginnings of vowels and allophones of vowels are made use of. This method has some disadvantages, the most important one being the problems related to changing the length of sounds. The transition points between the sounds are stored in the database, the duration of sounds can be calculated exactly

List of Phonemes in Lithuanian

Naming Conventions of Sounds

List of Diphones in Lithuanian

Problem of Non-existent Diphones in Text-to-Speech Synthesis

List of Replacements of Diphones

Lithuanian Diphone Databases

The Aim of MBROLA Project

Operation of MBROLA Synthesizer

Lithuanian MBROLA Databases

10. Transcription

11. Application of Diphone Databases in Speech Synthesis

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Informatica	Publication Date: Jan 1, 2005
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Diphone Databases for Lithuanian Text-to-Speech Synthesis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Informatica

Lead the way for us

Similar Papers

WordNet Based Sindhi Text to Speech Synthesis System
Javed Ahmed Mahar ... Ghulam Qadir Memon
-
Javed Ahmed Mahar, et. al.Javed Ahmed Mahar ... Ghulam Qadir Memon
01 Jan 2009
01 Jan 2009

Russian Phonetic Variability and Connected Speech Transcription
Vladimir I Kuznetsov ... Tatiana Y Sherstinova
-
Vladimir I Kuznetsov, et. al.Vladimir I Kuznetsov ... Tatiana Y Sherstinova
01 Jan 1999
01 Jan 1999

The French language database: Defining, planning, and recording a large database
R Carre ... M Rossi
-
R Carre, et. al.R Carre ... M Rossi
19 Mar 1984
19 Mar 1984

Validation of the Acoustic Voice Quality Index in the Lithuanian Language
Virgilijus Uloza ... Youri Maryn
Journal of Voice | VOL. 31
Virgilijus Uloza, et. al.Virgilijus Uloza ... Youri Maryn
15 Jul 2016
Journal of Voice | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diphone Databases for Lithuanian Text-to-Speech Synthesis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Informatica