A real-time wavelet-based algorithm for improving speech intelligibility

Yijia Chen,Eugene Chau,Keegan Y Sim,Kevin Chau,Jiakun Zheng,Yuxuan Wan

doi:10.1121/1.5147551

Abstract

A wavelet-based algorithm to improve speech intelligibility is reported. The speech signal is split into frequency sub-bands via a multi-level discrete wavelet transform. Various gains are applied to the sub-band signals before they are recombined to form a modified version of the speech. Dynamic range compression then follows to control the peak amplitude. The sub-band gains are adjusted while keeping the overall signal energy unchanged, and the speech intelligibility under simulated hearing loss conditions and various background interference is enhanced and evaluated objectively and quantitatively using Google Speech-to-Text transcription. For English and Chinese noise-free speech, overall intelligibility is improved, and the transcription accuracy can increase by over 80 percentage points by reallocating the spectral energy toward the mid-frequency sub-bands, effectively increasing the consonant-vowel intensity ratio. This is reasonable since the consonants are relatively weak and of short duration, and are therefore the most likely to become indistinguishable in the presence of background noise or high frequency hearing impairment. For speech already corrupted by noise, improving intelligibility is challenging but still realizable. The proposed algorithm is implementable in real-time and comparatively simpler than previous algorithms. Potential applications include speech transmission, hearing aids, machine listening, and a better understanding of speech intelligibility.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A real-time wavelet-based algorithm for improving speech intelligibility

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Multichannel Compression Hearing Aids: Perceptual Considerations
Amyn M Amlani
The ASHA Leader | VOL. 13
Amyn M AmlaniAmyn M Amlani
01 Mar 2008
The ASHA Leader | VOL. 13

Noise-management algorithm may improve speech intelligibility in noise
Francis K Kuk ... Carsten Paludan-Müller
The Hearing Journal | VOL. 59
Francis K Kuk, et. al.Francis K Kuk ... Carsten Paludan-Müller
01 Apr 2006
The Hearing Journal | VOL. 59

Effect of Digital Noise Reduction in Hearing Aids on Speech Intelligibility in Both Quiet and Noisy Environments.
Burcu Deniz ... Eyyup Kara
Noise & health | VOL. 26
Burcu Deniz, et. al.Burcu Deniz ... Eyyup Kara
01 Apr 2024
Noise & health | VOL. 26

The effect of hearing aid dynamic range compression on speech intelligibility in a realistic virtual sound environment.
Naim Mansour ... Marton Marschall
The Journal of the Acoustical Society of America | VOL. 151
Naim Mansour, et. al.Naim Mansour ... Marton Marschall
01 Jan 2021
The Journal of the Acoustical Society of America | VOL. 151

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A real-time wavelet-based algorithm for improving speech intelligibility

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America