Using ideal binary masking based on signal-to-noise ratio of temporal amplitude envelope to improve the intelligibility of speech in noise

Rahim Soleymanpour,Anthony J Brammer,Hillary Marquis,Erin Heiney,Kia Golzari,Insoo Kim

doi:10.1121/10.0008276

Abstract

Ideal Binary Masking (IBM), using prior information, improves speech intelligibility by attenuating noisy components with a scaling factor applied to the noise. The main challenge is to construct an appropriate decision-making model to identify noise- or speech- dominant components. In this study, we utilized the signal-to-noise ratio (SNR) of the temporal amplitude envelope in the frequency-time domain. We firstly divided the noisy speech from 200 Hz to 6 kHz, processed by MATLAB, into 16 contiguous subbands each with bandwidth approximately 1.5 times an equivalent rectangular bandwidth. The subband envelopes were produced by means of the absolute value of the signal. SNRs of the temporal envelope were calculated for 40 ms windows. The mask was unity when the SNR was greater than −5dB; otherwise, it was 0.5. We evaluated the performance of the proposed IBM on word scores obtained with different speech in speech-spectrum shaped noise SNR values of −2, −4, −6, and −8 dB. Sixteen native speakers (age 28 ± 3 years) with normal hearing were recruited for the study and underwent the Modified Rhyme Test to assess intelligibility. Statistically significant increases of up to 20% in mean word scores were obtained by this IBM. [Work supported by NIOSH.]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using ideal binary masking based on signal-to-noise ratio of temporal amplitude envelope to improve the intelligibility of speech in noise

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Speech intelligibility in background noise with ideal binary time-frequency masking
Deliang Wang ... Michael S Pedersen
The Journal of the Acoustical Society of America | VOL. 125
Deliang Wang, et. al.Deliang Wang ... Michael S Pedersen
01 Apr 2009
The Journal of the Acoustical Society of America | VOL. 125

Improving intelligibility of dysarthric speech in noise for listeners with hearing loss
Sarah Yoho Leopold ... Stephanie Borrie
The Journal of the Acoustical Society of America | VOL. 154
Sarah Yoho Leopold, et. al.Sarah Yoho Leopold ... Stephanie Borrie
01 Oct 2023
The Journal of the Acoustical Society of America | VOL. 154

The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression
R Niederjohn ... J Grotelueschen
IEEE Transactions on Acoustics, Speech, and Signal Processing | VOL. 24
R Niederjohn, et. al.R Niederjohn ... J Grotelueschen
01 Aug 1976
IEEE Transactions on Acoustics, Speech, and Signal Processing | VOL. 24

Simulation of the effect of threshold elevation and loudness recruitment combined with reduced frequency selectivity on the intelligibility of speech in noise.
Yoshito Nejime ... Brian C J Moore
The Journal of the Acoustical Society of America | VOL. 102
Yoshito Nejime, et. al.Yoshito Nejime ... Brian C J Moore
01 Jul 1997
The Journal of the Acoustical Society of America | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using ideal binary masking based on signal-to-noise ratio of temporal amplitude envelope to improve the intelligibility of speech in noise

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America