The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006

H. ZEN,K. TOKUDA,T. TODA

doi:10.1093/ietisy/e91-d.6.1764

Abstract

We describe a statistical parametric speech synthesis system developed by a joint group from the Nagoya Institute of Technology (Nitech) and the Nara Institute of Science and Technology (NAIST) for the annual open evaluation of text-to-speech synthesis systems named Blizzard Challenge 2006. To improve our 2005 system (Nitech-HTS 2005), we investigated new features such as mel-generalized cepstrum-based line spectral pairs (MGC-LSPs), maximum likelihood linear transform (MLLT), and a full covariance global variance (GV) probability density function (pdf). A combination of mel-cepstral coefficients, MLLT, and full covariance GV pdf scored highest in subjective listening tests, and the 2006 system performed significantly better than the 2005 system. The Blizzard Challenge 2006 evaluations show that Nitech-NAIST-HTS 2006 is competitive even when working with relatively large speech databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Information and Systems	Publication Date: Jun 1, 2008
Citations: 74	License type: free

R Discovery Prime

R Discovery Prime

The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Similar Papers

Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition
M.K Omar ... M Hasegawa-Johnson
IEEE Transactions on Signal Processing | VOL. 52
M.K Omar, et. al.M.K Omar ... M Hasegawa-Johnson
01 Oct 2004
IEEE Transactions on Signal Processing | VOL. 52

Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis
N.P Narendra ... K Sreenivasa Rao
Speech Communication | VOL. 77
N.P Narendra, et. al.N.P Narendra ... K Sreenivasa Rao
23 Dec 2015
Speech Communication | VOL. 77

Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system
Eunwoo Song ... Hong-Goo Kang
-
Eunwoo Song, et. al.Eunwoo Song ... Hong-Goo Kang
01 Apr 2015
01 Apr 2015

Constructing a Deep Neural Network Based Spectral Model for Statistical Speech Synthesis
Shinji Takaki ... Junichi Yamagishi
-
Shinji Takaki, et. al.Shinji Takaki ... Junichi Yamagishi
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Nitech-NAIST HMM-Based Speech Synthesis System for the Blizzard Challenge 2006

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems