Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Shengkui Zhao,Trung Hieu Nguyen,Bin Ma

doi:10.1109/icassp39728.2021.9414569

Abstract

Deep complex U-Net structure and convolutional recurrent network (CRN) structure achieve state-of-the-art performance for monaural speech enhancement. Both deep complex U-Net and CRN are encoder and decoder structures with skip connections, which heavily rely on the representation power of the complex-valued convolutional layers. In this paper, we propose a complex convolutional block attention module (CCBAM) to boost the representation power of the complex-valued convolutional layers by constructing more informative features. The CCBAM is a lightweight and general module which can be easily integrated into any complex-valued convolutional layers. We integrate CCBAM with the deep complex U-Net and CRN to enhance their performance for speech enhancement. We further propose a mixed loss function to jointly optimize the complex models in both time-frequency (TF) domain and time domain. By integrating CCBAM and the mixed loss, we form a new end-to-end (E2E) complex speech enhancement framework. Ablation experiments and objective evaluations show the superior performance of the proposed approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Monaural Speech Enhancement using Deep Neural Network with Cross-Speech Dataset
Norezmi Jamal ... Shahnoor Shanta
-
Norezmi Jamal, et. al.Norezmi Jamal ... Shahnoor Shanta
13 Sep 2021
13 Sep 2021

Deep beamforming for speech enhancement and speaker localization with an array response-aware loss function
Hsinyu Chang ... Yicheng Hsu
Frontiers in Signal Processing | VOL. 4
Hsinyu Chang, et. al.Hsinyu Chang ... Yicheng Hsu
10 Sep 2024
Frontiers in Signal Processing | VOL. 4

End-to-End Speech Enhancement Using Fully Convolutional Networks with Skip Connections
Dujuan Wang ... Changchun Bao
-
Dujuan Wang, et. al.Dujuan Wang ... Changchun Bao
01 Nov 2019
01 Nov 2019

Speech Enhancement via Mask-Mapping Based Residual Dense Network
Lin Zhou ... Qiuyue Zhong
Computers, Materials & Continua | VOL. 74
Lin Zhou, et. al.Lin Zhou ... Qiuyue Zhong
01 Jan 2023
Computers, Materials & Continua | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses

Abstract

Talk to us

Similar Papers