Bayesian group sparse learning for music source separation

Jen-Tzung Chien,Hsin-Lung Hsieh

doi:10.1186/1687-4722-2013-18

Abstract

Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative signals with the sparseness constraint. The signals are adequately represented by a set of basis vectors and the corresponding weight parameters. NMF has been successfully applied for blind source separation and many other signal processing systems. Typically, controlling the degree of sparseness and characterizing the uncertainty of model parameters are two critical issues for model regularization using NMF. This paper presents the Bayesian group sparse learning for NMF and applies it for single-channel music source separation. This method reconstructs the rhythmic or repetitive signal from a common subspace spanned by the shared bases for the whole signal and simultaneously decodes the harmonic or residual signal from an individual subspace consisting of separate bases for different signal segments. A Laplacian scale mixture distribution is introduced for sparse coding given a sparseness control parameter. The relevance of basis vectors for reconstructing two groups of music signals is automatically determined. A Markov chain Monte Carlo procedure is presented to infer two sets of model parameters and hyperparameters through a sampling procedure based on the conditional posterior distributions. Experiments on separating single-channel audio signals into rhythmic and harmonic source signals show that the proposed method outperforms baseline NMF, Bayesian NMF, and other group-based NMF in terms of signal-to-interference ratio.

Highlights

Many problems in audio, speech and music processing can be tackled through matrix factorization
This paper presents a new Bayesian group sparse learning for Nonnegative matrix factorization (NMF) and applied it for single-channel music source separation
This distribution is shaped as a Laplacian scale mixture (LSM) distribution which is estimated from the 2nd segment of “music 2”. 4.3 Evaluation for single-channel music source separation A quantitative comparison over different NMFs is conducted by measuring signal-to-interference ratio (SIR) of reconstructed rhythmic signal and reconstructed harmonic signal

Summary

Introduction

Speech and music processing can be tackled through matrix factorization. Different cost functions and constraints may lead to different factorized matrices. This procedure can identify underlying sources from the mixed signals through blind source separation [1]. Nonnegative matrix factorization (NMF) and its extensions to different regularization functions are introduced. Bayesian learning methods for matrix factorization and other related tasks are introduced. Given a data matrix X = {Xik}, NMF estimates two factorized matrices A = {Aij} and S = {Sjk} by minimizing the reconstruction error between X and AS. In [2], the sparseness constraint was imposed on minimization of an objective function F which is based on a regularized error function

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Jul 5, 2013
Citations: 30	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Bayesian group sparse learning for music source separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Advances in Nonnegative Matrix and Tensor Factorization
A Cichocki ... P Smaragdis
Computational Intelligence and Neuroscience | VOL. 2008
A Cichocki, et. al.A Cichocki ... P Smaragdis
01 Jan 2008
Computational Intelligence and Neuroscience | VOL. 2008

Speech Denoising Using Bayesian NMF with Online Base Update
Weili Zhou ... Zhen Zhu
-
Weili Zhou, et. al.Weili Zhou ... Zhen Zhu
01 May 2018
01 May 2018

Interpretable machine learning model using Extreme Gradient Boosting with Nonnegative Matrix Factorization improves the accuracy of arrhythmic risk prediction in Brugada Syndrome
G Tse ... K H C Li
European Heart Journal | VOL. 44
G Tse, et. al.G Tse ... K H C Li
09 Nov 2023
European Heart Journal | VOL. 44

Hidden Markov models as priors for regularized nonnegative matrix factorization in single-channel source separation
Emad M Grais ... Hakan Erdogan
-
Emad M Grais, et. al.Emad M Grais ... Hakan Erdogan
09 Sep 2012
09 Sep 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian group sparse learning for music source separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing