Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation

Jianyu Wang,Xiao-Lei Zhang,Shanzheng Guan,Shupei Liu

doi:10.1109/taslp.2021.3120603

Abstract

Multichannel blind audio source separation aims to recover the latent sources from their multichannel mixtures without supervised information. One state-of-the-art blind audio source separation method, named independent low-rank matrix analysis (ILRMA), unifies independent vector analysis (IVA) and nonnegative matrix factorization (NMF). However, the spectra matrix produced from NMF may not find a compact spectral basis. It may not guarantee the identifiability of each source as well. To address this problem, here we propose to enhance the identifiability of the source model by a minimum-volume prior distribution. We further regularize a multichannel NMF (MNMF) and ILRMA respectively with the minimum-volume regularizer. The proposed methods maximize the posterior distribution of the separated sources, which ensures the stability of the convergence. Experimental results demonstrate the effectiveness of the proposed methods compared with auxiliary independent vector analysis, MNMF, ILRMA and its extensions. The source code is available at https://github.com/alexwang9654/m-ILRMA .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 3

Similar Papers

Determined Blind Source Separation with Independent Low-Rank Matrix Analysis
Daichi Kitamura ... Hiroshi Sawada
-
Daichi Kitamura, et. al.Daichi Kitamura ... Hiroshi Sawada
01 Jan 2018
01 Jan 2018

Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation
Taihui Wang ... Jun Yang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Taihui Wang, et. al.Taihui Wang ... Jun Yang
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Multi-channel Non-negative Matrix Factorization Initialized with Full-rank and Rank-1 Spatial Correlation Matrix for Speech Recognition
Yuuki Tachioka
-
Yuuki TachiokaYuuki Tachioka
01 Nov 2018
01 Nov 2018

Independent Low-Rank Matrix Analysis Based on Multivariate Complex Exponential Power Distribution
Rintaro Ikeshita ... Yohei Kawaguchi
-
Rintaro Ikeshita, et. al.Rintaro Ikeshita ... Yohei Kawaguchi
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing