Fepstrum representation of speech signal

V Tyagi,C Wellekens

doi:10.1109/asru.2005.1566475

Abstract

Pole-zero spectral models in the frequency domain have been well studied and understood in the past several decades. Exploiting the duality between the temporal domain and the frequency domain, Kumaresan et al (R. Kumaresan, et al., March 1999), (R. Kumaresan, October 1998) have shown that the pole-zero model of the analytic speech signal in the temporal domain leads to its characterization in terms of the positive amplitude modulation (AM) and positive instantaneous frequency (PIF). In this paper, we carefully define AM and frequency modulation (FM) signals in the context of ASR. We show that for a theoretically meaningful estimation of the AM signal, it is necessary to decompose the speech signal into several narrow spectral bands as opposed to the previous use of the speech modulation spectrum (V. Tyagi, et al., 2003), (M. Athineos and D. Ellis, 2003), (M. Athineos, et al., April 2004), (Q. Zhu, and A. Alwan, 2000), (B. E. D. Kingsbury, et al., Aug. 1998), which was derived by decomposing the speech signal into increasingly wider spectral bands (such as critical, Bark or Mel). The estimated AM message signals are downsampled and their lower DCT coefficients are retained as speech features. These features carry information that is complementary to the MFCCs. A Tandem (H. Hermansky, 2003), (D. P. W. Ellis, et al., May 2001) combination of these two features is shown to improve recognition accuracy

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fepstrum representation of speech signal

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fepstrum and Carrier Signal Decomposition of Speech Signals Through Homomorphic Filtering
V Tyagi ... C Wellekens
-
V Tyagi, et. al.V Tyagi ... C Wellekens
14 May 2006
14 May 2006

Novel speech processing techniques for robust automatic speech recognition

-

01 Jan 2006
01 Jan 2006

New pulse oximetry detection based on the light absorbance ratio as determined from amplitude modulation indexes in the time and frequency domains
Pattana Kainan ... Paramote Wardkein
Biomedical Signal Processing and Control | VOL. 75
Pattana Kainan, et. al.Pattana Kainan ... Paramote Wardkein
11 Mar 2022
Biomedical Signal Processing and Control | VOL. 75

Evaluation of Detection Response of an Electric Field Probe to AM Signals Using Equivalent Circuit Model
Ifong Wu ... Yasushi Matsumoto
-
Ifong Wu, et. al.Ifong Wu ... Yasushi Matsumoto
01 Sep 2019
01 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fepstrum representation of speech signal

Abstract

Talk to us

Similar Papers