Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

Keikichi Hirose,Tatsuya Matsuda,Hiroya Hashimoto,Nobuaki Minematsu

doi:10.1109/mlsp.2011.6064596

Abstract

Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">0</inf> ) contour generation by HMM-based speech synthesis. A method is developed to modify F <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">0</inf> contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">0</inf> variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-/non- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Use of fundamental frequencies shaped by generation process model for HMM-based speech synthesis
Keikichi Hirose ... Hiroya Hashimoto
-
Keikichi Hirose, et. al.Keikichi Hirose ... Hiroya Hashimoto
01 Oct 2014
01 Oct 2014

Use of generation process model for synthesizing fundamental frequency contours in HMM-based speech synthesis
Keikichi Hirose ... Hiroya Hashimoto
-
Keikichi Hirose, et. al.Keikichi Hirose ... Hiroya Hashimoto
01 Oct 2012
01 Oct 2012

Use of Generation Process Model for Improved Control of Fundamental Frequency Contours in HMM-Based Speech Synthesis
Keikichi Hirose
-
Keikichi HiroseKeikichi Hirose
01 Jan 2015
01 Jan 2015

Control of fundamental frequency contours using the generation process model in HMM-based speech synthesis
Tetsuya Matsuda ... Nobuaki Minematsu
-
Tetsuya Matsuda, et. al.Tetsuya Matsuda ... Nobuaki Minematsu
01 Oct 2010
01 Oct 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

Abstract

Talk to us

Similar Papers