Representation Learning for Single-Channel Source Separation and Bandwidth Extension

Matthias Zohrer,Franz Pernkopf,Robert Peharz

doi:10.1109/taslp.2015.2470560

Abstract

In this paper, we use deep representation learning for model-based single-channel source separation (SCSS) and artificial bandwidth extension (ABE). Both tasks are ill-posed and source-specific prior knowledge is required. In addition to well-known generative models such as restricted Boltzmann machines and higher order contractive autoencoders two recently introduced deep models, namely generative stochastic networks (GSNs) and sum-product networks (SPNs), are used for learning spectrogram representations. For SCSS we evaluate the deep architectures on data of the 2 $^{\rm nd}$ CHiME speech separation challenge and provide results for a speaker dependent, a speaker independent, a matched noise condition and an unmatched noise condition task. GSNs obtain the best PESQ and overall perceptual score on average in all four tasks. Similarly, frame-wise GSNs are able to reconstruct the missing frequency bands in ABE best, measured in frequency-domain segmental SNR. They outperform SPNs embedded in hidden Markov models and the other representation models significantly.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Representation Learning for Single-Channel Source Separation and Bandwidth Extension

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Dec 1, 2015
Citations: 20

Similar Papers

On representation learning for artificial bandwidth extension
Matthias Zöhrer ... Robert Peharz
-
Matthias Zöhrer, et. al.Matthias Zöhrer ... Robert Peharz
06 Sep 2015
06 Sep 2015

Exploration of class specific ABWE for robust children's ASR under mismatched condition
Y Sunil ... R Sinha
-
Y Sunil, et. al.Y Sunil ... R Sinha
01 Jul 2012
01 Jul 2012

Representation models in single channel source separation
Matthias Zohrer ... Franz Pernkopf
-
Matthias Zohrer, et. al.Matthias Zohrer ... Franz Pernkopf
01 Apr 2015
01 Apr 2015

Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages
H Pulakka ... J Pohjalainen
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
H Pulakka, et. al.H Pulakka ... J Pohjalainen
01 Aug 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Representation Learning for Single-Channel Source Separation and Bandwidth Extension

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing