Single channel source separation with general stochastic networks

Matthias Zöhrer,Franz Pernkopf

doi:10.21437/interspeech.2014-258

Single channel source separation with general stochastic networks

Matthias Zöhrer, Franz Pernkopf

https://doi.org/10.21437/interspeech.2014-258

Copy DOI

Publication Date: Sep 14, 2014

Citations: 2

Affiliation: Signal Processing (United States)

#Deep Neural Network Architecture #Single Channel Source Separation + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Single channel source separation (SCSS) is ill-posed and thus challenging. In this paper, we apply general stochastic networks (GSNs) – a deep neural network architecture – to SCSS. We extend GSNs to be capable of predicting a time-frequency representation, i.e. softmask by introducing a hybrid generative-discriminative training objective to the network. We evaluate GSNs on data of the 2nd CHiME speech separation challenge. In particular, we provide results for a speaker dependent, a speaker independent, a matched noise condition and an unmatched noise condition task. Empirically, we compare to other deep architectures, namely a deep belief network (DBN) and a multi-layer perceptron (MLP). In general, deep architectures perform well on SCSS tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.