Self-Attention Generative Adversarial Network for Speech Enhancement

Huy Phan,Ian Mcloughlin,Huy Le Nguyen,Ngoc Q K Duong,Oliver Y Chen,Philipp Koch,Alfred Mertins

doi:10.1109/icassp39728.2021.9414265

Abstract

Existing generative adversarial networks (GANs) for speech enhancement solely rely on the convolution operation, which may obscure temporal dependencies across the sequence input. To remedy this issue, we propose a self-attention layer adapted from non-local attention, coupled with the convolutional and deconvolutional layers of a speech enhancement GAN (SEGAN) using raw signal input. Further, we empirically study the effect of placing the self-attention layer at the (de)convolutional layers with varying layer indices as well as at all of them when memory allows. Our experiments show that introducing self-attention to SEGAN leads to consistent improvement across the objective evaluation metrics of enhancement performance. Furthermore, applying at different (de)convolutional layers does not significantly alter performance, suggesting that it can be conveniently applied at the highest-level (de)convolutional layer with the smallest memory overhead <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Attention Generative Adversarial Network for Speech Enhancement

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploring Multi-Stage GAN with Self-Attention for Speech Enhancement
Bismark Kweku Asiedu Asante ... Hiroki Imamura
Applied Sciences | VOL. 13
Bismark Kweku Asiedu Asante, et. al.Bismark Kweku Asiedu Asante ... Hiroki Imamura
14 Aug 2023
Applied Sciences | VOL. 13

Super-Resolution Generative Adversarial Network with Modified Architecture for Single Image Super-Resolution
N Amruth Gowtham ... Dipti Patra
-
N Amruth Gowtham, et. al.N Amruth Gowtham ... Dipti Patra
28 Sep 2020
28 Sep 2020

Light-Weight Self-Attention Augmented Generative Adversarial Networks for Speech Enhancement
Lujun Li ... Zhenxing Lu
Electronics | VOL. 10
Lujun Li, et. al.Lujun Li ... Zhenxing Lu
30 Jun 2021
Electronics | VOL. 10

ACG-Engine: An Inference Accelerator for Content Generative Neural Networks
Haobo Xu ... Bosheng Liu
-
Haobo Xu, et. al.Haobo Xu ... Bosheng Liu
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Attention Generative Adversarial Network for Speech Enhancement

Abstract

Talk to us

Similar Papers