Flow-Based Independent Vector Analysis for Blind Source Separation

Aditya Arie Nugraha,Kazuyoshi Yoshii,Mathieu Fontaine,Kouhei Sekiguchi,Yoshiaki Bando

doi:10.1109/lsp.2020.3039944

Abstract

This letter describes a time-varying extension of independent vector analysis (IVA) based on the normalizing flow (NF), called NF-IVA, for determined blind source separation of multichannel audio signals. As in IVA, NF-IVA estimates demixing matrices that transform mixture spectra to source spectra in the complex-valued spatial domain such that the likelihood of those matrices for the mixture spectra is maximized under some non-Gaussian source model. While IVA performs a time-invariant bijective linear transformation, NF-IVA performs a series of time-varying bijective linear transformations (flow blocks) adaptively predicted by neural networks. To regularize such transformations, we introduce a soft volume-preserving (VP) constraint. Given mixture spectra, the parameters of NF-IVA are optimized by gradient descent with backpropagation in an unsupervised manner. Experimental results show that NF-IVA successfully performs speech separation in reverberant environments with different numbers of speakers and microphones and that NF-IVA with the VP constraint outperforms NF-IVA without it, standard IVA with iterative projection, and improved IVA with gradient descent.

Highlights

The widespread use of devices equipped with many microphones, e.g., smart speakers and smartphones, demands audio source separation methods that can effectively exploit the spatial information captured in the multichannel recordings [1], [2]
We show that standard independent vector analysis (IVA) can be interpreted as a simple normalizing flow (NF) with a single flow step and extended to general NF-IVA based on a more expressive NF with a series of flow steps
As a generalization of IVA, we propose NF-IVA that uses more than one flow step grouped into flow blocks performing time-varying transformation

Summary

Introduction

The widespread use of devices equipped with many microphones, e.g., smart speakers and smartphones, demands audio source separation methods that can effectively exploit the spatial information captured in the multichannel recordings [1], [2]. Such methods are useful for downstream applications, e.g., automatic speech recognition (ASR) and human listening. While the supervised approaches based on deep neural networks (DNNs) [3]–[5] have been shown to work well, unsupervised separation, a.k.a. blind source separation (BSS), techniques are potentially better to handle unseen, unknown environments. The associate editor coordinating the review of this manuscript and approving it for publication was Prof.

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2020
Citations: 48	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Flow-Based Independent Vector Analysis for Blind Source Separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Similar Papers

Independent vector analysis: Model, applications, challenges
Zhongqiang Luo
Pattern Recognition | VOL. 138
Zhongqiang LuoZhongqiang Luo
02 Feb 2023
Pattern Recognition | VOL. 138

Non-unitary matrix joint diagonalization for complex independent vector analysis
Hao Shen ... Martin Kleinsteuber
EURASIP Journal on Advances in Signal Processing | VOL. 2012
Hao Shen, et. al.Hao Shen ... Martin Kleinsteuber
21 Nov 2012
EURASIP Journal on Advances in Signal Processing | VOL. 2012

IVA algorithms using a multivariate Student's t source prior for speech source separation in real room environments
Waqas Rafique ... Syed Mohsen Naqvi
-
Waqas Rafique, et. al.Waqas Rafique ... Syed Mohsen Naqvi
01 Apr 2015
01 Apr 2015

Independent vector analysis based on overlapped cliques of variable width for frequency-domain blind signal separation
Intae Lee ... Gil-Jin Jang
EURASIP Journal on Advances in Signal Processing | VOL. 2012
Intae Lee, et. al.Intae Lee ... Gil-Jin Jang
23 May 2012
EURASIP Journal on Advances in Signal Processing | VOL. 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Flow-Based Independent Vector Analysis for Blind Source Separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters