FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement

Jun Chen,Helen Meng,Zhiyong Wu,Shiyin Kang,Deyi Tuo,Zilin Wang

doi:10.1109/icassp43922.2022.9747888

Abstract

Previously proposed FullSubNet has achieved outstanding performance in Deep Noise Suppression (DNS) Challenge and attracted much attention. However, it still encounters issues such as input-output mismatch and coarse processing for frequency bands. In this paper, we propose an extended single-channel real-time speech enhancement framework called FullSubNet+ with following significant improvements. First, we design a lightweight multi-scale time sensitive channel attention (MulCA) module which adopts multi-scale convolution and channel attention mechanism to help the network focus on more discriminative frequency bands for noise reduction. Then, to make full use of the phase information in noisy speech, our model takes all the magnitude, real and imaginary spectrograms as inputs. Moreover, by replacing the long short-term memory (LSTM) layers in original full-band model with stacked temporal convolutional network (TCN) blocks, we design a more efficient full-band module called full-band extractor. The experimental results in DNS Challenge dataset show the superior performance of our FullSubNet+, which reaches the state-of-the-art (SOTA) performance and outperforms other existing speech enhancement approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Monaural Speech Enhancement Using a Multi-Branch Temporal Convolutional Network
Qiquan Zhang ... Aaron Nicolson
SSRN Electronic Journal | VOL. -
Qiquan Zhang, et. al.Qiquan Zhang ... Aaron Nicolson
01 Jan 2021
SSRN Electronic Journal | VOL. -

Short-term prediction of the significant wave height and average wave period based on the variational mode decomposition–temporal convolutional network–long short-term memory (VMD–TCN–LSTM) algorithm
Qiyan Ji ... Yuting Zhang
Ocean Science | VOL. 19
Qiyan Ji, et. al.Qiyan Ji ... Yuting Zhang
09 Nov 2023
Ocean Science | VOL. 19

Speech Enhancement Using Convolutional Recurrent Neural Network with Twin Gate Units and Two-Stage Modeling
Baosheng Lv ... Yongbao Ma
-
Baosheng Lv, et. al.Baosheng Lv ... Yongbao Ma
09 Dec 2022
09 Dec 2022

A Hybrid Prediction Method for Realistic Network Traffic With Temporal Convolutional Network and LSTM
Jing Bi ... Haitao Yuan
IEEE Transactions on Automation Science and Engineering | VOL. 19
Jing Bi, et. al.Jing Bi ... Haitao Yuan
22 May 2021
IEEE Transactions on Automation Science and Engineering | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement

Abstract

Talk to us

Similar Papers