Speech-Signal-Based Frequency Warping

Kuldip Paliwal,Benjamin Shannon,Kamil Wojcicki,James Lyons

doi:10.1109/lsp.2009.2014096

Abstract

The speech signal is used for transmission of linguistic information. High energy portions of the speech spectrum have higher signal-to-noise ratios than the low energy portions. As a result, these regions are more robust to noise. Since the speech signal is known to be very robust to noise, it is expected that the high energy regions of the speech spectrum carry the majority of the linguistic information. This letter tries to derive a frequency warping function directly from the speech signal by sampling the frequency axis nonuniformly with the high energy regions sampled more densely than the low energy regions. To achieve this, an ensemble average short-time power spectrum is computed from a large speech corpus. The speech-signal-based frequency warping is obtained by considering equal area portions of the log spectrum. The proposed frequency warping is shown to be similar to the frequency scales obtained through psycho-acoustic experiments, namely the mel and bark scales. The warping is then used in filterbank design for automatic speech recognition experiments. The results of these experiments show that cepstral features based on the proposed warping achieve performance under clean conditions comparable to that of mel-frequency cepstral coefficients, while outperforming them under noisy conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech-Signal-Based Frequency Warping

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Apr 1, 2009
Citations: 33

Similar Papers

An Adaptive Method for Robust Detection of Vowels in Noisy Environment
Avinash Kumar ... Gayadhar Pradhan
Circuits, Systems, and Signal Processing | VOL. 38
Avinash Kumar, et. al.Avinash Kumar ... Gayadhar Pradhan
08 Feb 2019
Circuits, Systems, and Signal Processing | VOL. 38

A novel approach in feature level for robust text-independent speaker identification system
Susanta Kumar Sarangi ... Goutam Saha
-
Susanta Kumar Sarangi, et. al.Susanta Kumar Sarangi ... Goutam Saha
01 Dec 2012
01 Dec 2012

Directed crystallisation of zinc oxide on patterned surfaces
Dennis Palms ... Gerhard Wegner
Journal of Colloid And Interface Science | VOL. 303
Dennis Palms, et. al.Dennis Palms ... Gerhard Wegner
03 Aug 2006
Journal of Colloid And Interface Science | VOL. 303

Measurements of D/H Ratio Using Compact Neutral Particle Analyzer in LHD Deuterium Experiments
Tetsuo Ozaki ... Shuji Kamio
Plasma and Fusion Research | VOL. 15
Tetsuo Ozaki, et. al.Tetsuo Ozaki ... Shuji Kamio
08 Jun 2020
Plasma and Fusion Research | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech-Signal-Based Frequency Warping

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters