A phase generation method for speech reconstruction from spectral envelope and pitch intervals

Hong-Goo Kang,Hong Kook Kim

doi:10.1109/icassp.2002.5743746

Abstract

In this paper, we propose a new speech reconstruction method from spectral envelope and pitch intervals, which is applicable to the network side of a distributed speech recognition system as a play-back function. The spectral envelope of speech is represented as a set of mel-frequency cepstral coefficients that is a well-known recognition parameter. First, a sinusoidal synthesis with a zero-phase model is used to obtain a pitch-based waveform. To enhance the naturalness of the speech we replace the zero phase information with pre-stored linear and random codebooks. The ultimate phase information is determined depending on the energy ratio between linear and random components. Unlike the classic low bit-rate speech coding, however, the energy ratio is estimated in the decoding stage from a time-frequency filter applied to the pitch-based synthesized signal. Thus, the phase information is not a feature parameter from the encoder side. The proposed phase generation method uses the knowledge that pitch variation is a main cause of the mixed characteristics in speech signals. An informal listening test verifies that the quality of the proposed method is much better than that of the synthetic quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A phase generation method for speech reconstruction from spectral envelope and pitch intervals

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A phase generation method for speech reconstruction from spectral envelope and pitch intervals
Hong-Goo Kang ... Hong Kook Kim
-
Hong-Goo Kang, et. al. Hong-Goo Kang ... Hong Kook Kim
01 Jan 2002
01 Jan 2002

Modeling formant dynamics in speech spectral envelopes
Alexandra Craciun ... Gokhan Sevkin
-
Alexandra Craciun, et. al.Alexandra Craciun ... Gokhan Sevkin
01 Aug 2017
01 Aug 2017

Relationship between physical characteristics and speaker individualities in speech spectral envelopes
Tatsuya Kitamura ... Masato Akagi
The Journal of the Acoustical Society of America | VOL. 100
Tatsuya Kitamura, et. al.Tatsuya Kitamura ... Masato Akagi
01 Oct 1996
The Journal of the Acoustical Society of America | VOL. 100

Glottal Spectral Separation for Speech Synthesis
Joao P Cabral ... Steve Renals
IEEE Journal of Selected Topics in Signal Processing | VOL. 8
Joao P Cabral, et. al.Joao P Cabral ... Steve Renals
01 Apr 2014
IEEE Journal of Selected Topics in Signal Processing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A phase generation method for speech reconstruction from spectral envelope and pitch intervals

Abstract

Talk to us

Similar Papers