Abstract

Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative signals with the sparseness constraint. The signals are adequately represented by a set of basis vectors and the corresponding weight parameters. NMF has been successfully applied for blind source separation and many other signal processing systems. Typically, controlling the degree of sparseness and characterizing the uncertainty of model parameters are two critical issues for model regularization using NMF. This paper presents the Bayesian group sparse learning for NMF and applies it for single-channel music source separation. This method reconstructs the rhythmic or repetitive signal from a common subspace spanned by the shared bases for the whole signal and simultaneously decodes the harmonic or residual signal from an individual subspace consisting of separate bases for different signal segments. A Laplacian scale mixture distribution is introduced for sparse coding given a sparseness control parameter. The relevance of basis vectors for reconstructing two groups of music signals is automatically determined. A Markov chain Monte Carlo procedure is presented to infer two sets of model parameters and hyperparameters through a sampling procedure based on the conditional posterior distributions. Experiments on separating single-channel audio signals into rhythmic and harmonic source signals show that the proposed method outperforms baseline NMF, Bayesian NMF, and other group-based NMF in terms of signal-to-interference ratio.

Highlights

  • Many problems in audio, speech and music processing can be tackled through matrix factorization

  • This paper presents a new Bayesian group sparse learning for Nonnegative matrix factorization (NMF) and applied it for single-channel music source separation

  • This distribution is shaped as a Laplacian scale mixture (LSM) distribution which is estimated from the 2nd segment of “music 2”. 4.3 Evaluation for single-channel music source separation A quantitative comparison over different NMFs is conducted by measuring signal-to-interference ratio (SIR) of reconstructed rhythmic signal and reconstructed harmonic signal

Read more

Summary

Introduction

Speech and music processing can be tackled through matrix factorization. Different cost functions and constraints may lead to different factorized matrices. This procedure can identify underlying sources from the mixed signals through blind source separation [1]. Nonnegative matrix factorization (NMF) and its extensions to different regularization functions are introduced. Bayesian learning methods for matrix factorization and other related tasks are introduced. Given a data matrix X = {Xik}, NMF estimates two factorized matrices A = {Aij} and S = {Sjk} by minimizing the reconstruction error between X and AS. In [2], the sparseness constraint was imposed on minimization of an objective function F which is based on a regularized error function

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.