A classification based approach to speech segregation

Kun Han,Deliang Wang

doi:10.1121/1.4754541

Abstract

A key problem in computational auditory scene analysis (CASA) is monaural speech segregation, which has proven to be very challenging. For monaural mixtures, one can only utilize the intrinsic properties of speech or interference to segregate target speech from background noise. Ideal binary mask (IBM) has been proposed as a main goal of sound segregation in CASA and has led to substantial improvements of human speech intelligibility in noise. This study proposes a classification approach to estimate the IBM and employs support vector machines to classify time-frequency units as either target- or interference-dominant. A re-thresholding method is incorporated to improve classification results and maximize hit minus false alarm rates. An auditory segmentation stage is utilized to further improve estimated masks. Systematic evaluations show that the proposed approach produces high quality estimated IBMs and outperforms a recent system in terms of classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A classification based approach to speech segregation

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Nov 1, 2012
Citations: 81

Similar Papers

Review of Ideal Binary and Ratio Mask Estimation Techniques for Monaural Speech Separation
T M Minipriya ... R Rajavel
-
T M Minipriya, et. al.T M Minipriya ... R Rajavel
01 Feb 2018
01 Feb 2018

Speech intelligibility in background noise with ideal binary time-frequency masking
Deliang Wang ... Michael S Pedersen
The Journal of the Acoustical Society of America | VOL. 125
Deliang Wang, et. al.Deliang Wang ... Michael S Pedersen
01 Apr 2009
The Journal of the Acoustical Society of America | VOL. 125

Computational Auditory Scene Analysis: Principles, Algorithms and Applications
Chris Darwin
The Journal of the Acoustical Society of America | VOL. 124
Chris DarwinChris Darwin
01 Jul 2008
The Journal of the Acoustical Society of America | VOL. 124

A comparison of several computational auditory scene analysis (CASA) techniques for monaural speech segregation.
Jihen Zeremdini ... Aicha Bouzid
Brain informatics | VOL. 2
Jihen Zeremdini, et. al.Jihen Zeremdini ... Aicha Bouzid
04 Aug 2015
Brain informatics | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A classification based approach to speech segregation

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America