A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.

Tassadaq Hussain,Ahsan Adeel,Muhammad Diyan,Yu Tsao,Amir Hussain,Kia Dashtipour,Mandar Gogate

doi:10.1109/embc48229.2022.9871113

Abstract

Current deep learning (DL) based approaches to speech intelligibility enhancement in noisy environments are often trained to minimise the feature distance between noise-free speech and enhanced speech signals. Despite improving the speech quality, such approaches do not deliver required levels of speech intelligibility in everyday noisy environments. Intelligibility-oriented (I-O) loss functions have recently been developed to train DL approaches for robust speech enhancement. Here, we formulate, for the first time, a novel canonical correlation based I-O loss function to more effectively train DL algorithms. Specifically, we present a canonical-correlation based short-time objective intelligibility (CC-STOI) cost function to train a fully convolutional neural network (FCN) model. We carry out comparative simulation experiments to show that our CC-STOI based speech enhancement framework outperforms state-of-the-art DL models trained with conventional distance-based and STOI-based loss functions, using objective and subjective evaluation measures for case of both unseen speakers and noises. Ongoing future work is evaluating the proposed approach for design of robust hearing-assistive technology.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Lead the way for us

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Publication Date: Jul 11, 2022
Citations: 5

Similar Papers

Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks.
Tao-Wei Wang ... Hsin-Min Wang
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. 29
Tao-Wei Wang, et. al.Tao-Wei Wang ... Hsin-Min Wang
01 Jan 2020
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. 29

Noise-management algorithm may improve speech intelligibility in noise
Carsten Paludan-Müller ... Francis K Kuk
The Hearing Journal | VOL. 59
Carsten Paludan-Müller, et. al.Carsten Paludan-Müller ... Francis K Kuk
01 Apr 2006
The Hearing Journal | VOL. 59

A Study of Joint Effect on Denoising Techniques and Visual Cues to Improve Speech Intelligibility in Cochlear Implant Simulation
Chia-Ying Lee ... Yu Tsao
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13
Chia-Ying Lee, et. al.Chia-Ying Lee ... Yu Tsao
17 Aug 2020
IEEE Transactions on Cognitive and Developmental Systems | VOL. 13

Analysis of clinical features of large-cell neuroendocrine carcinoma patients guided by chest CT image under deep learning
Haiyun Zhou ... Juan Li
The Journal of Supercomputing | VOL. 77
Haiyun Zhou, et. al.Haiyun Zhou ... Juan Li
05 Feb 2021
The Journal of Supercomputing | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference