Flexible post-lexical processing for speech synthesis from a large unit selection database

Mark Beutnagel

doi:10.1121/1.427291

Abstract

Online unit selection from large speech databases provides an opportunity to essentially play back words, phrases, and even sentences which were included in a recorded corpus. This capability can be extremely useful for limited domains, e.g., application prompts. Without switching voices, such a synthesizer could integrate high-quality synthesis with near-perfect recorded material. However, traditional post-lexical processing (PLP) considers only the phoneme specifications and not the sequences which actually exist in the target database. Phonemes supplied by the dictionary are typically rewritten into a single sequence of phones with reduced vowels, flapped t’s, etc. Given the enormous variability of human speech, any single sequence is unlikely to match an entire phrase or prompt as spoken and labeled. This paper addresses the use of flexible PLP, allowing multiple transcription possibilities which are essentially equivalent, at least for the speaker in question. By building the equivalences from the specific dictionary used by the synthesizer and the detailed phonetic labeling of a specific voice database, longer regions of the database can be selected, reducing the number of concatenation points in ordinary synthesis and increasing the odds of selecting complete recorded phrases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Flexible post-lexical processing for speech synthesis from a large unit selection database

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Database research in transfusion medicine: The power of large numbers.
Steven Kleinman ... Simone A Glynn
Transfusion | VOL. 55
Steven Kleinman, et. al.Steven Kleinman ... Simone A Glynn
01 Jul 2015
Transfusion | VOL. 55

Unit selection in a concatenative speech synthesis system using a large speech database
A.J Hunt ... A.W Black
-
A.J Hunt, et. al.A.J Hunt ... A.W Black
07 May 1996
07 May 1996

Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context
T Lambert ... Stephen J Cox
-
T Lambert, et. al.T Lambert ... Stephen J Cox
01 Sep 2003
01 Sep 2003

Quantitative method for modeling context in concatenative synthesis using large speech database
W Hamza ... M Afify
-
W Hamza, et. al.W Hamza ... M Afify
07 May 2001
07 May 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flexible post-lexical processing for speech synthesis from a large unit selection database

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America