Abstract

The multichannel Wiener filter (MWF) is a well-established noise reduction technique for speech processing. Most commonly, the speech component in a selected reference microphone is estimated. The choice of this reference microphone influences the broadband output signal-to-noise ratio (SNR) as well as the speech distortion. Recently, a generalized formulation for the MWF (G-MWF) was proposed that uses a weighted sum of the individual transfer functions from the speaker to the microphones to form a better speech reference resulting in an improved broadband output SNR. For the MWF, the influence of the phase reference is often neglected, because it has no impact on the narrow-band output SNR. The G-MWF allows an arbitrary choice of the phase reference especially in the context of spatially distributed microphones. In this work, we demonstrate that the phase reference determines the overall transfer function and hence has an impact on both the speech distortion and the broadband output SNR. We propose two speech references that achieve a better signal-to-reverberation ratio (SRR) and an improvement in the broadband output SNR. Both proposed references are based on the phase of a delay-and-sum beamformer. Hence, the time-difference-of-arrival (TDOA) of the speech source is required to align the signals. The different techniques are compared in terms of SRR and SNR performance.

Highlights

  • Research on speech enhancement using socalled acoustic sensor networks consisting of spatially distributed microphones has gained significant interest [1,2,3,4,5,6,7,8,9,10,11,12]

  • We present a further generalization of the G-multichannel Wiener filter (MWF) approach in [7], which enables different phase references

  • We present a theoretical analysis of the broadband output signal-to-noise ratio (SNR) of the generalized formulation for the MWF (G-MWF)

Read more

Summary

Introduction

Research on speech enhancement using socalled acoustic sensor networks consisting of spatially distributed microphones has gained significant interest [1,2,3,4,5,6,7,8,9,10,11,12]. In [5], an MWF formulation with partial equalization (P-MWF) was presented, where the overall transfer function was chosen as the envelope of the individual ATFs with the phase of an arbitrary reference microphone. As shown in [19], the temporal smearing and the reverberation relies on the all-pass component of the overall transfer function This suggests that a suitable phase reference can improve the output SRR of the system.

Signal model and notation
MVDR beamformer
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.