A new speech enhancement method based on Swin-UNet model

Chengli Sun,Yan Leng,Feilong Chen,Weiqi Jiang

doi:10.3397/1/377122

Abstract

U-shaped Network (UNet) has shown excellent performance in a variety of speech enhancement tasks. However, because of the intrinsic limitation of convolutional operation, traditional UNet built with convolutional neural network (CNN) cannot learn global and long-term information well. In this work, we propose a new Swin-UNet-based speech enhancement method. Unlike the traditional UNet model, the CNN blocks are all replaced with Swin-Transformer blocks to explore more multi-scale contextual information. The Swin-UNet model employs shifted window mechanism which not only overcomes the defect of high computational complexity of the Transformer but also enhances global information interaction by utilizing the powerful global modeling capability of the Transformer. Through hierarchical Swin-Transformer blocks, global and local speech features can be fully leveraged to improve speech reconstruction ability. Experimental results confirm that the proposed method can eliminate more background noise while maintaining good objective speech quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new speech enhancement method based on Swin-UNet model

Abstract

Talk to us

Similar Papers

More From: Noise Control Engineering Journal

Lead the way for us

Journal: Noise Control Engineering Journal	Publication Date: Jul 1, 2023
Citations: 2

Similar Papers

Multichannel Speech Enhancement by Raw Waveform-Mapping Using Fully Convolutional Networks
Chang-Le Liu ... Jen-Wei Huang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Chang-Le Liu, et. al.Chang-Le Liu ... Jen-Wei Huang
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

Noise dependent coherence-super Gaussian based dual microphone speech enhancement for hearing aid application using smartphone
Nikhil Shankar ... Chandan Karadagur Ananda Reddy
The Journal of the Acoustical Society of America | VOL. 143
Nikhil Shankar, et. al.Nikhil Shankar ... Chandan Karadagur Ananda Reddy
01 Mar 2018
The Journal of the Acoustical Society of America | VOL. 143

Noise Classification Speech Enhancement Generative Adversarial Network
Tao Feng ... Ye Li
-
Tao Feng, et. al.Tao Feng ... Ye Li
04 Mar 2022
04 Mar 2022

RETRACTED: Speech enhancement method using deep learning approach for hearing-impaired listeners.
... Vs Jayanthi
Health informatics journal | VOL. 27
, et. al. ... Vs Jayanthi
23 Jan 2020
Health informatics journal | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new speech enhancement method based on Swin-UNet model

Abstract

Talk to us

Similar Papers

More From: Noise Control Engineering Journal