A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

Zhaozhang Jin Zhaozhang Jin,Deliang Wang Deliang Wang

doi:10.1109/tasl.2008.2010633

Abstract

A major source of signal degradation in real environments is room reverberation. Monaural speech segregation in reverberant environments is a particularly challenging problem. Although inverse filtering has been proposed to partially restore the harmonicity of reverberant speech before segregation, this approach is sensitive to specific source/receiver and room configurations. This paper proposes a supervised learning approach to monaural segregation of reverberant voiced speech, which learns to map from a set of pitch-based auditory features to a grouping cue encoding the posterior probability of a time-frequency (T-F) unit being target dominant given observed features. We devise a novel objective function for the learning process, which directly relates to the goal of maximizing signal-to-noise ratio. The models trained using this objective function yield significantly better T-F unit labeling. A segmentation and grouping framework is utilized to form reliable segments under reverberant conditions and organize them into streams. Systematic evaluations show that our approach produces very promising results under various reverberant conditions and generalizes well to new utterances and new speakers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: May 1, 2009
Citations: 135

Similar Papers

Monaural segregation of reverberant speech
Zhaozhang Jin ... Deliang Wang
The Journal of the Acoustical Society of America | VOL. 123
Zhaozhang Jin, et. al.Zhaozhang Jin ... Deliang Wang
01 May 2008
The Journal of the Acoustical Society of America | VOL. 123

Learning to maximize signal-to-noise ratio for reverberant speech segregation
Zhaozhang Jin ... Deliang Wang
-
Zhaozhang Jin, et. al.Zhaozhang Jin ... Deliang Wang
01 Apr 2009
01 Apr 2009

Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks
Yi Jiang ... Runsheng Liu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Yi Jiang, et. al.Yi Jiang ... Runsheng Liu
01 Dec 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest
R Venkatesan ... A Balaji Ganesh
Multimedia Tools and Applications | VOL. 77
R Venkatesan, et. al.R Venkatesan ... A Balaji Ganesh
02 Dec 2017
Multimedia Tools and Applications | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Supervised Learning Approach to Monaural Segregation of Reverberant Speech

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing