Use of generation process model for synthesizing fundamental frequency contours in HMM-based speech synthesis

Keikichi Hirose,Nobuaki Minematsu,Jun Ikeshima,Hiroya Hashimoto

doi:10.1109/icosp.2012.6491554

Abstract

Generation process model of fundamental frequency contours is ideal to represent global features of prosody. It is a command response model, where the commands have clear relations with linguistic and para/non linguistic information conveyed by the utterance. Therefore, by handling fundamental frequency contours in the framework of the generation process model, prosody control with increased flexibility comes possible in speech synthesis. Also, the model can be used to solve problems of HMM-based speech synthesis, which arise from frame-by-frame treatment of fundamental frequencies. Two ways are possible; before training and after generation processes. The former is to suppress unnatural fundamental frequency movements of speech for HMM training, and the latter is to reshape the fundamental frequency contours, generated by HMM-based speech synthesis. A method of prosody conversion is also developed, which views the model command differences between original and target styles. The method enables flexible control of fundamental frequency contours in speech synthesis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Use of generation process model for synthesizing fundamental frequency contours in HMM-based speech synthesis

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Use of Generation Process Model for Improved Control of Fundamental Frequency Contours in HMM-Based Speech Synthesis
Keikichi Hirose
-
Keikichi HiroseKeikichi Hirose
01 Jan 2015
01 Jan 2015

Control of fundamental frequency contours using the generation process model in HMM-based speech synthesis
Tetsuya Matsuda ... Nobuaki Minematsu
-
Tetsuya Matsuda, et. al.Tetsuya Matsuda ... Nobuaki Minematsu
01 Oct 2010
01 Oct 2010

Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model
Keikichi Hirose ... Nobuaki Minematsu
-
Keikichi Hirose, et. al.Keikichi Hirose ... Nobuaki Minematsu
01 Sep 2011
01 Sep 2011

Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis
-
-
--
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Use of generation process model for synthesizing fundamental frequency contours in HMM-based speech synthesis

Abstract

Talk to us

Similar Papers