Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept

Francesc Alías,Lluís Formiga,Xavier Llorá

doi:10.1016/j.specom.2011.01.004

Francesc Alías, Lluís Formiga + Show 1 more

Open Access

https://doi.org/10.1016/j.specom.2011.01.004

Copy DOI

Abstract

Unit-selection speech synthesis is one of the current corpus-based text-to-speech synthesis techniques. The quality of the generated speech depends on the accuracy of the unit selection process, which in turn relies on the cost function definition. This function should map the user perceptual preferences when selecting synthesis units, which is still an open research issue. This paper proposes a complete methodology for the tuning of the cost function weights by fusing the human judgments with the cost function, through efficient and reliable interactive weight tuning. To that effect, active interactive genetic algorithms (aiGAs) are used to guide the subjective weight adjustments. The application of aiGAs to this process allows mitigating user fatigue and frustration by improving user consistency. However, it is still unfeasible to subjectively adjust the weights of the whole corpus units (diphones and triphones in this work). This makes it mandatory to perform unit clustering before conducting the tuning process. The aiGA-based weight tuning proposal is evaluated in a small speech corpus as a proof-of-concept and results in more natural synthetic speech when compared to previous objective and subjective-based approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Speech Communication	Publication Date: Jan 15, 2011
Citations: 15	License type: other-oa

R Discovery Prime

R Discovery Prime

Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Similar Papers

Evolutionary process indicators for active IGAs applied to weight tuning in unit selection TTS synthesis
Lluis Formiga ... Francesc Alias
-
Lluis Formiga, et. al.Lluis Formiga ... Francesc Alias
01 Jul 2010
01 Jul 2010

Efficient Interactive Weight Tuning For Tts Synthesis: Reducing User Fatigue By Improving User Consistency
F Alias ... X Llora
-
F Alias, et. al.F Alias ... X Llora
01 Jan 2006
01 Jan 2006

Trainable unit selection speech synthesis under statistical framework
Renhua Wang ... Lirong Dai
Science Bulletin | VOL. 54
Renhua Wang, et. al.Renhua Wang ... Lirong Dai
01 Jun 2009
Science Bulletin | VOL. 54

OPTIMIZATION OF COST FUNCTION WEIGHTS FOR UNIT SELECTION SPEECH SYNTHESIS USING SPEECH RECOGNITION
Miran Pobar ... Sanda Martinčić-Ipšić
Neural Network World | VOL. 22
Miran Pobar, et. al.Miran Pobar ... Sanda Martinčić-Ipšić
01 Jan 2012
Neural Network World | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient and reliable perceptual weight tuning for unit-selection text-to-speech synthesis based on active interactive genetic algorithms: A proof-of-concept

Abstract

Talk to us

Similar Papers

More From: Speech Communication