Speech enhancement by noise driven adaptation of perceptual scales and thresholds of continuous wavelet transform coefficients

Preety D Swami,Rupali Sharma,Alok Jain,Dhirendra K Swami

doi:10.1016/j.specom.2015.02.007

Abstract

This paper focuses on employing adaptive scales for computation of perceptually scaled continuous wavelet transform coefficients (CWT) and adaptive thresholding of these coefficients for speech enhancement. The adaptive scales and thresholds both were decided on the basis of the noise level of the noisy speech signal. The CWT coefficients were scaled perceptually and the proposed algorithm suggests selection of number of scales required for analysis on the basis of noise level. The CWT coefficients were then thresholded and for this a novel method of generating adaptive thresholds that too depends on the noise level of the noisy signal has also been proposed. Speech signals were acquired from the TIMIT database and evaluation of the proposed method is done by corrupting these signals by white Gaussian noise (at −10, −5, 0, 5, 10, 15 and 20dB SNRs) and four real world noises (each at 0dB SNR); pink, babble, car interior and F16 cockpit noise from the NOISEX-92 database. Enhancement results are compared on the basis of signal to noise ratio (SNR), segmental SNR (SSNR), spectral distortion (SD) and perceptual evaluation of speech quality (PESQ).Results of the proposed method are evaluated against Ephraim Malah filtering, Stein’s unbiased risk estimate (SURE) thresholding of bionic wavelet transform (BWT) coefficients (BWT-SURE), Wiener filtering (WF), perceptually scaled wavelet packet transform (PWT), multi-model WF and multi-model sparse code shrinkage (MultiSCS) enhancement methods. For the white Gaussian noise case, at all noise levels, SNR and SSNR of the proposed method were better than all the methods under comparison. SD and PESQ results were lower than multiSCS method at 10dB SNR but better at 15dB and 20dB SNRs. For the babble noise case, the obtained results were lower than Ephraim Malah but better than BWT-SURE. SNR and SSNR results for the cockpit noise were comparable with Ephraim Malah and BWT-SURE while for the pink noise case, the proposed method gives the best results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech enhancement by noise driven adaptation of perceptual scales and thresholds of continuous wavelet transform coefficients

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Mar 2, 2015
Citations: 20

Similar Papers

Multichannel MMSE Wiener Filter Using Complex Real and Imaginary Spectral Coefficients for Distributed Microphone Speech Enhancement
...
-
, et. al. ...
20 Dec 2016
20 Dec 2016

Singular Values Decomposition and Lifting Wavelet Transform for Speech Signal Embedding into Digital Image
Mourad Talbi ... Med Salim Bouhlel
Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) | VOL. 12
Mourad Talbi, et. al.Mourad Talbi ... Med Salim Bouhlel
28 Feb 2019
Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) | VOL. 12

Speech signal enhancement through adaptive wavelet thresholding
Michael T Johnson ... Yao Ren
Speech Communication | VOL. 49
Michael T Johnson, et. al.Michael T Johnson ... Yao Ren
19 Dec 2006
Speech Communication | VOL. 49

Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB
Ninad Bhatt ... Yogeshwar Kosta
International Journal of Speech Technology | VOL. 15
Ninad Bhatt, et. al.Ninad Bhatt ... Yogeshwar Kosta
12 Jan 2012
International Journal of Speech Technology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech enhancement by noise driven adaptation of perceptual scales and thresholds of continuous wavelet transform coefficients

Abstract

Talk to us

Similar Papers

More From: Speech Communication