Single Channel Speech Enhancement Using Adaptive Soft-Thresholding with Bivariate EMD

Md Ekramul Hamid,Takayoshi Nakai,Xin Dang,Md Khademul Islam Molla

doi:10.1155/2013/724378

Md Ekramul Hamid, Takayoshi Nakai + Show 2 more

Open Access

https://doi.org/10.1155/2013/724378

Copy DOI

Abstract

This paper presents a novel data adaptive thresholding approach to single channel speech enhancement. The noisy speech signal and fractional Gaussian noise (fGn) are combined to produce the complex signal. The fGn is generated using the noise variance roughly estimated from the noisy speech signal. Bivariate empirical mode decomposition (bEMD) is employed to decompose the complex signal into a finite number of complex-valued intrinsic mode functions (IMFs). The real and imaginary parts of the IMFs represent the IMFs of observed speech and fGn, respectively. Each IMF is divided into short time frames for local processing. The variance of IMF of fGn calculated within a frame is used as the reference term to classify corresponding noisy speech frame into noise and signal dominant frames. Only the noise dominant frames are soft-thresholded to reduce the noise effects. Then, all the frames as well as IMFs of speech are combined, yielding the enhanced speech signal. The experimental results show the improved performance of the proposed algorithm compared to the recently reported methods.

Highlights

The research on speech enhancement is motivated by the rapidly growing market of speech communication applications, such as teleconferencing, hands-free telephony, hearing-aids, and speech recognition
The human auditory system is remarkably robust in most adverse situations, noise effects heavily affect the performance of automatic speech recognition (ASR) systems
Its main drawback is to find the speechless part to determine the noise variance. The performance of this method depends on the efficiency of voice activity detection (VAD), and it is not convenient to implement for practical applications

Summary

Introduction

The research on speech enhancement is motivated by the rapidly growing market of speech communication applications, such as teleconferencing, hands-free telephony, hearing-aids, and speech recognition. Its basic requirement is the noise spectrum which is determined from the nonspeech segments [3] In such single channel speech enhancement system, the residual noise is a usual issue. Instead of the speech signal, the variance of each IMF is used to determine the adaptive threshold, and better performance is achieved in [5]. Its main drawback is to find the speechless part to determine the noise variance The performance of this method depends on the efficiency of voice activity detection (VAD), and it is not convenient to implement for practical applications. This paper is organized as follows: the application of bEMD on speech and noise signals are described, the noise variance estimation process is explained, the proposed speech enhancement method using bEMD is described, experimental results are illustrated, and Section 6 contains some concluding remarks This paper is organized as follows: the application of bEMD on speech and noise signals are described in Section 2, the noise variance estimation process is explained in Section 3, the proposed speech enhancement method using bEMD is described in Section 4, experimental results are illustrated in Section 5, and Section 6 contains some concluding remarks

BEMD of Speech and Reference Signals

Estimation of Noise Variance

Speech Enhancement Method

Experimental Results and Discussions

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ISRN Signal Processing	Publication Date: Jul 31, 2013
Citations: 20	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Single Channel Speech Enhancement Using Adaptive Soft-Thresholding with Bivariate EMD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISRN Signal Processing

Lead the way for us

Similar Papers

Bivariate EMD‐Based Data Adaptive Approach to the Analysis of Climate Variability
Md Khademul Islam Molla ... M De La Sen
Discrete Dynamics in Nature and Society | VOL. 2011
Md Khademul Islam Molla, et. al.Md Khademul Islam Molla ... M De La Sen
01 Jan 2010
Discrete Dynamics in Nature and Society | VOL. 2011

Pitch estimation of noisy speech signals using EMD-fourier based hybrid algorithm
Sujan Kumar Roy ... Keikichi Hirose
-
Sujan Kumar Roy, et. al.Sujan Kumar Roy ... Keikichi Hirose
01 May 2010
01 May 2010

Voiced/non-voiced speech classification using adaptive thresholding with bivariate EMD
Md Khademul Islam Molla ... Keikichi Hirose
Pattern Analysis and Applications | VOL. 19
Md Khademul Islam Molla, et. al.Md Khademul Islam Molla ... Keikichi Hirose
25 Jan 2015
Pattern Analysis and Applications | VOL. 19

Instantaneous pitch estimation of noisy speech signal with multivariate SST
Md Khademul Islam Molla ... Mahboob Qaosar
-
Md Khademul Islam Molla, et. al.Md Khademul Islam Molla ... Mahboob Qaosar
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Single Channel Speech Enhancement Using Adaptive Soft-Thresholding with Bivariate EMD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ISRN Signal Processing