The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications

Thomas Drugman,Thierry Dutoit

doi:10.1109/tasl.2011.2169787

Abstract

The modeling of speech production often relies on a source-filter approach. Although methods parameterizing the filter have nowadays reached a certain maturity, there is still a lot to be gained for several speech processing applications in finding an appropriate excitation model. This manuscript presents a Deterministic plus Stochastic Model (DSM) of the residual signal. The DSM consists of two contributions acting in two distinct spectral bands delimited by a maximum voiced frequency. Both components are extracted from an analysis performed on a speaker-dependent dataset of pitch-synchronous residual frames. The deterministic part models the low-frequency contents and arises from an orthonormal decomposition of these frames. As for the stochastic component, it is a high-frequency noise modulated both in time and frequency. Some interesting phonetic and computational properties of the DSM are also highlighted. The applicability of the DSM in two fields of speech processing is then studied. First, it is shown that incorporating the DSM vocoder in HMM-based speech synthesis enhances the delivered quality. The proposed approach turns out to significantly outperform the traditional pulse excitation and provides a quality equivalent to STRAIGHT. In a second application, the potential of glottal signatures derived from the proposed DSM is investigated for speaker identification purpose. Interestingly, these signatures are shown to lead to better recognition rates than other glottal-based methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Mar 1, 2012
Citations: 141

Similar Papers

The Handbook of Phonetic Sciences
William J Hardcastle
-
William J HardcastleWilliam J Hardcastle
01 Jan 1998
01 Jan 1998

A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis
Thomas Drugman ... Geoffrey Wilfart
-
Thomas Drugman, et. al.Thomas Drugman ... Geoffrey Wilfart
06 Sep 2009
06 Sep 2009

The NEF-SPA Approach as a Framework for Developing a Neurobiologically Inspired Spiking Neural Network Model for Speech Production.
Bernd J Kröger
Journal of integrative neuroscience | VOL. 22
Bernd J KrögerBernd J Kröger
16 Aug 2023
Journal of integrative neuroscience | VOL. 22

What is the relationship between phonological short-term memory and speech processing?
Charlotte Jacquemot ... Sophie K Scott
Trends in Cognitive Sciences | VOL. 10
Charlotte Jacquemot, et. al.Charlotte Jacquemot ... Sophie K Scott
25 Sep 2006
Trends in Cognitive Sciences | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing