Neural-Free Attention for Monaural Speech Enhancement Toward Voice User Interface for Consumer Electronics

Moran Chen,Xinyuan Qian,Ruijin Guo,Deying Chen,Qiquan Zhang,Mingjiang Wang,Qi Song

doi:10.1109/tce.2023.3254507

Abstract

The traditional graphic user interface in healthcare-oriented consumer electronics faced challenges such as high operational complexity, time-consuming operations, and a high risk of infection. The adoption of voice user interface (VUI) could promote network automation with enhanced efficiency, reduced simplicity and operating expense in various applications. Given noisy operational environments, speech enhancement acts as an indispensable component for VUIs towards consumer devices. Recently, attention mechanism is studied for speech enhancement and exhibits promising potential. In this paper, we propose a novel and effective attention module for speech enhancement, called neural-free attention (NFA), which is a lightweight and plug-and-play module that enables the backbone network to capture the energy distribution information of speech signals along frequency-wise channels. Particularly, NFA adopts a learnable Gaussian function to perform the excitation operation and produce the attention weights for each frequency channel. The NFA is comprehensively evaluated as part of the residual temporal convolution network (ResTCN) backbone network on two commonly used training targets. Experimental results show NFA substantially improves the ResTCN backbone in speech quality and intelligibility, with extremely low parameter overhead. Also, the ResTCN+NFA shows superiority over several recent baseline models, indicating the strong potential for VUIs toward consumer devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural-Free Attention for Monaural Speech Enhancement Toward Voice User Interface for Consumer Electronics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Consumer Electronics

Lead the way for us

Journal: IEEE Transactions on Consumer Electronics	Publication Date: Nov 1, 2023
Citations: 4

Similar Papers

Kalman Filtering with Machine Learning Methods for Speech Enhancement

-

04 May 2021
04 May 2021

Noise-management algorithm may improve speech intelligibility in noise
Francis K Kuk ... Carsten Paludan-Müller
The Hearing Journal | VOL. 59
Francis K Kuk, et. al.Francis K Kuk ... Carsten Paludan-Müller
01 Apr 2006
The Hearing Journal | VOL. 59

Speech enhancement using robust estimators and rank-order statistics
Yuma Sandoval ... Victor H Diaz-Ramirez
COMPEL - The international journal for computation and mathematics in electrical and electronic engineering | VOL. 35
Yuma Sandoval, et. al.Yuma Sandoval ... Victor H Diaz-Ramirez
03 May 2016
COMPEL - The international journal for computation and mathematics in electrical and electronic engineering | VOL. 35

Speech Enhancement
Philipos C Loizou
-
Philipos C LoizouPhilipos C Loizou
25 Feb 2013
25 Feb 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural-Free Attention for Monaural Speech Enhancement Toward Voice User Interface for Consumer Electronics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Consumer Electronics