Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

J Vepa,S King

doi:10.1109/tsa.2005.858548

Abstract

In unit selection-based concatenative speech synthesis, join cost (also known as concatenation cost), which measures how well two units can be joined together, is one of the main criteria for selecting appropriate units from the inventory. Usually, some form of local parameter smoothing is also needed to disguise the remaining discontinuities. This paper presents a subjective evaluation of three join cost functions and three smoothing methods. We also describe the design and performance of a listening test. The three join cost functions were taken from our previous study, where we proposed join cost functions derived from spectral distances, which have good correlations with perceptual scores obtained for a range of concatenation discontinuities. This evaluation allows us to further validate their ability to predict concatenation discontinuities. The units for synthesis stimuli are obtained from a state-of-the-art unit selection text-to-speech system: rVoice from Rhetorical Systems Ltd. In this paper, we report listeners' preferences for each join cost in combination with each smoothing method

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech and Language Processing	Publication Date: Sep 1, 2006
Citations: 46

Similar Papers

A data driven method for target and concatenation cost calculation with KL-Divergence in Mandarin hybrid speech synthesis
Shanfeng Liu ... Jianhua Tao
-
Shanfeng Liu, et. al.Shanfeng Liu ... Jianhua Tao
01 Oct 2014
01 Oct 2014

Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech
Xian-Jun Xia ... Li-Rong Dai
-
Xian-Jun Xia, et. al.Xian-Jun Xia ... Li-Rong Dai
01 Dec 2012
01 Dec 2012

DNN-based unit selection using frame-sized speech segments
Zhi-Ping Zhou ... Zhen-Hua Ling
-
Zhi-Ping Zhou, et. al.Zhi-Ping Zhou ... Zhen-Hua Ling
01 Oct 2016
01 Oct 2016

Maximum likelihood unit selection for corpus-based speech synthesis
Abubeker Gamboa Rosales ... Ruediger Hoffmann
-
Abubeker Gamboa Rosales, et. al.Abubeker Gamboa Rosales ... Ruediger Hoffmann
06 Sep 2009
06 Sep 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech and Language Processing