Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation

Tom Barker,Tuomas Virtanen

doi:10.1109/ijcnn.2014.6889522

Abstract

This paper details the use of a semi-supervised approach to audio source separation. Where only a single source model is available, the model for an unknown source must be estimated. A mixture signal is separated through factorisation of a feature-tensor representation, based on the modulation spectrogram. Harmonically related components tend to modulate in a similar fashion, and this redundancy of patterns can be isolated. This feature representation requires fewer parameters than spectrally based methods and so minimises overfitting. Following the tensor factorisation, the separated signals are reconstructed by learning appropriate Wiener-filter spectral parameters which have been constrained by activation parameters learned in the first stage. Strong results were obtained for two-speaker mixtures where source separation performance exceeded those used as benchmarks. Specifically, the proposed semi-supervised method outperformed both semi-supervised non-negative matrix factorisation and blind non-negative modulation spectrum tensor factorisation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Advances in Nonnegative Matrix and Tensor Factorization
A Cichocki ... M Mørup
Computational Intelligence and Neuroscience | VOL. 2008
A Cichocki, et. al.A Cichocki ... M Mørup
01 Jan 2008
Computational Intelligence and Neuroscience | VOL. 2008

Non-negative Multiple Tensor Factorization
Koh Takeuchi ... Katsuhiko Ishiguro
-
Koh Takeuchi, et. al.Koh Takeuchi ... Katsuhiko Ishiguro
01 Dec 2013
01 Dec 2013

Fast Local Algorithms for Large Scale Nonnegative Matrix and Tensor Factorizations
Andrzej Cichocki ... Anh-Huy Phan
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E92-A
Andrzej Cichocki, et. al.Andrzej Cichocki ... Anh-Huy Phan
01 Jan 2009
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E92-A

Automatic allocation of NTF components for user-guided audio source separation
Cagdas Bilen ... Patrick Perez
-
Cagdas Bilen, et. al.Cagdas Bilen ... Patrick Perez
20 Jan 2016
20 Jan 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-supervised non-negative tensor factorisation of modulation spectrograms for monaural speech separation

Abstract

Talk to us

Similar Papers