A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems

Yi Luo

doi:10.1109/taslp.2022.3205750

Abstract

Frequency-domain beamformers have been successful in a wide range of multi-channel neural separation systems in the past years. However, the operations in conventional frequency-domain beamformers are typically independently-defined and complex-valued, which result in two drawbacks: the former does not fully utilize the advantage of end-to-end optimization, and the latter may introduce numerical instability during the training phase. Motivated by the recent success in end-to-end neural separation systems, in this paper we propose time-domain real-valued generalized Wiener filter (TD-GWF), a linear filter defined on a 2-D learnable real-valued signal transform. TD-GWF splits the transformed representation into groups and performs an minimum mean-square error (MMSE) estimation on all available channels on each of the groups. We show how TD-GWF can be connected to conventional filter-and-sum beamformers when certain signal transform and the number of groups are specified. Moreover, given the recent success in the sequential neural beamforming frameworks, we show how TD-GWF can be applied in such frameworks to perform iterative beamforming and separation to obtain an overall performance gain. Comprehensive experiment results show that TD-GWF performs consistently better than conventional frequency-domain beamformers in the sequential neural beamforming pipeline with various neural network architectures, microphone array scenarios, and task configurations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2022
Citations: 11

Similar Papers

Error analysis of the MMSE estimator for multidimensional band-limited extrapolations from finite samples
X.-G Xia ... Z Zhang
Signal Processing | VOL. 36
X.-G Xia, et. al.X.-G Xia ... Z Zhang
01 Mar 1994
Signal Processing | VOL. 36

Non-coherent estimator-correlators for unresolved multipath Ricean channels
F Danilo-Lemoine ... H Leib
-
F Danilo-Lemoine, et. al.F Danilo-Lemoine ... H Leib
01 Jan 2003
01 Jan 2003

Efficient VQ-based MMSE estimation for robust speech recognition
Jose A Gonzalez ... Angel M Gomez
-
Jose A Gonzalez, et. al.Jose A Gonzalez ... Angel M Gomez
01 Mar 2010
01 Mar 2010

Concatenated MMSE Estimation for Quantized OFDM Systems
Hyowon Lee ... Yo-Seb Jeon
-
Hyowon Lee, et. al.Hyowon Lee ... Yo-Seb Jeon
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing