Abstract

This article proposes an efficient median filter based algorithm to remove transient noise in a speech signal. The proposed algorithm adopts a modified long-term predictor (LTP) as the pre-processor of the noise reduction process to reduce speech distortion caused by the nonlinear nature of the median filter. This article shows that the LTP analysis does not modify to the characteristic of transient noise during the speech modeling process. Oppositely, if a short-term linear prediction (STP) filter is employed as a pre-processor, the enhanced output includes residual noise because the STP analysis and synthesis process keeps and restores transient noise components. To minimize residual noise and speech distortion after the transient noise reduction, a modified LTP method is proposed which estimates the characteristic of speech more accurately. By ignoring transient noise presence regions in the pitch lag detection step, the modified LTP successfully avoids being affected by transient noise. A backward pitch prediction algorithm is also adopted to reduce speech distortion in the onset regions. Experimental results verify that the proposed system efficiently eliminates transient noise while preserving desired speech signal.

Highlights

  • Reducing noise from noise-corrupted speech is essential for communication or recording devices

  • If we discard transient noise presence region during the pitch lag estimation process given in Eq (13), the residual noise in the enhanced speech becomes much smaller than the system with the conventional long-term predictor (LTP)

  • Both the segmental signalto-noise ratio (SSNR) and the log-spectral distance (LSD) are improved by utilizing the LTP with the modified pitch lag detector in Eq (13)

Read more

Summary

Introduction

Reducing noise from noise-corrupted speech is essential for communication or recording devices. If transient noise component exists in the search range, a transient noise segment in the current frame can be predicted by the other transient noise in the search range In such case, the LTP filter models the characteristic of the transient noise and brings residual noise in synthesized speech. The LTP filter models the characteristic of the transient noise and brings residual noise in synthesized speech Another problem of the conventional LTP method is that the LTP filter cannot preserve pitch information at the onset and the transition region of speech because a reference pitch does not exists. The modified LTP significantly reduces the residual noise in an enhanced signal and successfully reconstructs desired speech after the transient noise reduction

Residual after STP analysis Residual after LTP analysis
SNR SSNR LSD SNR SSNR LSD
LTP with with with
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call