Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest

R Venkatesan,A Balaji Ganesh

doi:10.1007/s11042-017-5458-3

Abstract

An auditory attention model that consists of binaural source segregation and also full localization of a target speech signal in a multi-talker environment is presented. The joint acoustic features, such as monaural, binaural and direct to reverberant ratio (DRR) that are successfully incorporated into deep recurrent neural network (DRNN) based joint discriminative model for the speech source segregation process. The monaural and binaural features are extracted from binaural speech mixtures of two speakers by using mean Hilbert envelope coefficients (MHEC) and interaural time, and level differences, respectively. The performance of deep recurrent network based speech segregation is validated in terms of signal to interference, signal to distortion and signal to artifacts and compared with existing architectures, including deep neural network (DNN). The proposed system is observed and found to be more suitable than monaural speech segregation especially when the desired target and interfering sources are located at different positions. The study also proposes full localization of segregated speech source that created the possibility to select the desired speaker of interest from an input acoustic speech mixture in a reverberant environment. The developed system has the capability to handle binaural segregation problem in multi-source and reverberation conditions. The auditory attention model provides accurate information about speech sources even when the desired targets are located at 2 m and above with higher reverberation time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Dec 2, 2017
Citations: 5

Similar Papers

Binaural Classification-Based Speech Segregation and Robust Speaker Recognition System
R Venkatesan ... A Balaji Ganesh
Circuits, Systems, and Signal Processing | VOL. 37
R Venkatesan, et. al.R Venkatesan ... A Balaji Ganesh
23 Nov 2017
Circuits, Systems, and Signal Processing | VOL. 37

A DNN parameter mask for the binaural reverberant speech segregation
Yi Jiang ... Chao Ma
-
Yi Jiang, et. al.Yi Jiang ... Chao Ma
01 Oct 2016
01 Oct 2016

A regression approach to binaural speech segregation via deep neural network
Nana Fan ... Jun Du
-
Nana Fan, et. al.Nana Fan ... Jun Du
01 Oct 2016
01 Oct 2016

Binaural deep neural network classification for reverberant speech segregation
Yi Jiang ... Deliang Wang
-
Yi Jiang, et. al.Yi Jiang ... Deliang Wang
14 Sep 2014
14 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications