A study on attention-based objective function in deep denoising autoencoder based speech enhancement

Yi-Ying Kao,Kuo-Hsuan Hung,Shih-Kuang Lee,Ying-Hui Lai,Hsiang-Ping Hsu,Chen-Yu Chiang,Yu Tsao

doi:10.1121/1.5136680

Abstract

Speech is one of the most direct and convenient human\machine interfaces. In real-world scenarios, however, various interferences and noises may deteriorate the speech signals and thus reduce speech quality and intelligibility. Therefore, speech enhancement (SE) is an essential component in speech-communication systems. Recently, numerous deep-learning-based SE approaches have been proposed and yield satisfactory performance. In a deep-learning-based SE system, defining a proper objective function plays a crucial role to its success. Generally, the mean square error (MSE) of the predicted and desired outputs are used to form the objective function to learn the parameters in deep-learning models. Because a sequence of speech signals contains various patterns, such as consonant, vowel, beginning and ending silences, and short pauses, it is not optimal to simply use MSE as the objective function, since the contributions of these different patterns may be averaged out. Instead, we should apply specific weights for distinct patterns when designing the objective function. In this presentation, we present a novel objective function, which is used in deep denoising autoencoder-based SE system. The proposed objective function is derived by MSE with multiplying a ratio calculated from clean and noisy speech. The result is evaluated using standardized evaluation metrics, and experiment results confirm the proposed objective function is beneficial to improve the intelligibility of enhanced speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A study on attention-based objective function in deep denoising autoencoder based speech enhancement

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Dynamic acoustic compensation and adaptive focal training for personalized speech enhancement
Xiaofeng Ge ... Yanhua Long
Applied Acoustics | VOL. 216
Xiaofeng Ge, et. al.Xiaofeng Ge ... Yanhua Long
01 Jan 2024
Applied Acoustics | VOL. 216

Noise Adaptive Speech Enhancement Using Domain Adversarial Training
Chien-Feng Liao ... Hung-Yi Lee
-
Chien-Feng Liao, et. al.Chien-Feng Liao ... Hung-Yi Lee
15 Sep 2019
15 Sep 2019

U-Shaped Low-Complexity Type-2 Fuzzy LSTM Neural Network for Speech Enhancement
Nasir Saleem ... Irshad Hussain
IEEE Access | VOL. 11
Nasir Saleem, et. al.Nasir Saleem ... Irshad Hussain
01 Jan 2023
IEEE Access | VOL. 11

Statistical-model-based speech enhancement systems
Y Ephraim
Proceedings of the IEEE | VOL. 80
Y EphraimY Ephraim
01 Jan 1992
Proceedings of the IEEE | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A study on attention-based objective function in deep denoising autoencoder based speech enhancement

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America