Recurrent Neural Networks for Cochannel Speech Separation in Reverberant Environments

Masood Delfarah,Deliang Wang

doi:10.1109/icassp.2018.8462014

Abstract

Speech separation is a fundamental problem in speech and signal processing. A particular challenge is monaural separation of cochannel speech, or a two-talker mixture, in a reverberant environment. In this paper, we study recurrent neural networks (RNNs) with long short-term memory (LSTM) in separating and enhancing speech signals in reverberant cochannel mixtures. Our investigation shows that RNNs are effective in separating reverberant speech signals. In addition, RNNs significantly outperform deep feedforward networks based on objective speech intelligibility and quality measures. We also find that the best performance is achieved when the ideal ratio mask (IRM) is used as the training target in comparison with alternative training targets. While trained using reverberant signals generated by simulated room impulse responses (RIRs), our model generalizes well to conditions where the signals are generated by recorded RIRs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks for Cochannel Speech Separation in Reverberant Environments

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dataset of directional room impulse responses for realistic speech data
Stefan Fragner ... Franz Pernkopf
Data in Brief | VOL. 53
Stefan Fragner, et. al.Stefan Fragner ... Franz Pernkopf
22 Feb 2024
Data in Brief | VOL. 53

Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
Long Zhang ... Zhongfu Ye
Speech Communication | VOL. 97
Long Zhang, et. al.Long Zhang ... Zhongfu Ye
27 Dec 2017
Speech Communication | VOL. 97

A study on data augmentation of reverberant speech for robust speech recognition
Tom Ko ... Vijayaditya Peddinti
-
Tom Ko, et. al.Tom Ko ... Vijayaditya Peddinti
01 Mar 2017
01 Mar 2017

Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network
Alex Sherstinsky
Physica D: Nonlinear Phenomena | VOL. 404
Alex SherstinskyAlex Sherstinsky
21 Jan 2020
Physica D: Nonlinear Phenomena | VOL. 404

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks for Cochannel Speech Separation in Reverberant Environments

Abstract

Talk to us

Similar Papers