Abstract

In this paper, a new pipelined architecture of the multi-band spectral subtraction algorithm has been proposed for real-time speech enhancement. The proposed hardware has been implemented on field programmable gate array (FPGA) device using Xilinx system generator (XSG), high-level programming tool, and Nexys-4 development board. The multi-band algorithm has been developed to reduce the additive colored noise that does not uniformly affect the entire frequency band of useful signal. All the algorithm steps have been successfully implemented on hardware. Pipelining has been employed on this hardware architecture to increase the data throughput. Speech enhancement performances obtained by the hardware architecture are compared to those obtained by MATLAB simulation using simulated and actual noises. The resource utilization, the maximum operating frequency, and power consumption are reported for a low-cost Artix-7 FPGA device.

Highlights

  • The enhancement of speech corrupted by background noise represents a great challenge for real-word speech processing systems, such as speech recognition, speaker identification, voice coders, hand-free systems, and hearing aids

  • Comparison of the multi-band spectral subtraction algorithm to other speech enhancement methods has been evaluated in the literature

  • The enhancement performances of the proposed Xilinx system generator (XSG)-based architecture are compared to those obtained by MATLAB simulation using two objective tests: the overall signal-to-noise ratio and the segmental signal-to-noise ratio

Read more

Summary

Introduction

The enhancement of speech corrupted by background noise represents a great challenge for real-word speech processing systems, such as speech recognition, speaker identification, voice coders, hand-free systems, and hearing aids. Spectral subtraction method is a popular single-channel noise reduction algorithm that has been initially proposed for speech enhancement [1]. In real-world applications, such as hands-free communication kits, cellular phones and hearing aid devices, these speech enhancement techniques need to be executed in real-time Hardware implementation of this kind of algorithms is a difficult task that consists in finding a balance between complexity, efficiency and throughput of these algorithms. A new pipelined architecture of multi-band spectral subtraction method has been proposed for real-time speech enhancement. XC7A100T FPGA chip (Xilinx Inc, San Diego, CA, USA) Mathematical equations describing this speech enhancement algorithm (Fourier transform, signal power spectrum, noise power estimate, multi-band separation, signal-to-noise ratio, over-subtraction factor, spectral subtraction, multi-band merging, inverse Fourier transform, etc.) have been efficiently modeled using the XSG blockset.

Spectral Subtraction Methods
Basic Spectral Substraction
Generalized Spectral Subtraction
Multi-Band Spectral Subtraction
FPGA Implementation
Spectral Transformation and Noise Spectrum Estimation
Multi-Band Separation of Signal and Noise
Over-Subtraction Factor Calculation
Spectral Subtraction
Transformation Back to Time-Domain
Pipelining
Implementation Characteristics
Results and Discussion
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call