AlpSynth - concatenation-based speech synthesis for the Slovenian language

J.Z Gros,M Zganec,A Mihelic,N Pavesic,S Gruden

doi:10.1109/elmar.2005.193680

Abstract

The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AlpSynth - concatenation-based speech synthesis for the Slovenian language

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Design of speech corpus for text-to-speech synthesis
Jindrich Matousek ... Josef Psutka
-
Jindrich Matousek, et. al.Jindrich Matousek ... Josef Psutka
03 Sep 2001
03 Sep 2001

The Scottish Corpus of Texts and Speech: Problems of Corpus Design
F M Douglas
Literary and Linguistic Computing | VOL. 18
F M DouglasF M Douglas
01 Apr 2003
Literary and Linguistic Computing | VOL. 18

The CNG Corpus of European Portuguese Children’s Speech
Annika Hämäläinen ... Fernando Miguel Pinto
-
Annika Hämäläinen, et. al.Annika Hämäläinen ... Fernando Miguel Pinto
01 Jan 2013
01 Jan 2013

An Improved Greedy Search Algorithm for the Development of a Phonetically Rich Speech Corpus
J.-S Zhang ... S Nakamura
IEICE Transactions on Information and Systems | VOL. E91-D
J.-S Zhang, et. al.J.-S Zhang ... S Nakamura
01 Mar 2008
IEICE Transactions on Information and Systems | VOL. E91-D

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AlpSynth - concatenation-based speech synthesis for the Slovenian language

Abstract

Talk to us

Similar Papers