Spectrogram-based speech enhancement by spatial attention generative adversarial networks

Haixin Luo,Qian Wei,Jindong Tian,Shengyu Lu,Yu Fu

doi:10.1117/12.2644385

Abstract

The spectrogram can clearly show the composition of different frequencies in the speech signal. In this paper, a speech enhancement method based on deep learning image processing is proposed, which optimizes the spectrogram of the laser detected speech signal to achieve speech enhancement. The laser beam emitted by the laser Doppler vibrometer (LDV) is focused on the glass window to detect the vibration caused by sound wave. After conversion, the audio information that causes vibration is obtained. Under the interference of speckle noise and air disturbance, the detected speech signal not only has a low signal-to-noise ratio (SNR) but also has non-stationary noise. In order to overcome the difficulty that traditional methods are difficult to extract weak signals in the case of severe noise interference, we use deep learning to achieve spectrogram noise reduction and speech information enhancement. By processing the spectrogram of noisy speech with the generative adversarial networks (GAN) combined with the spatial attention mechanism and introducing the short-time objective intelligibility (STOI) into the loss function, the laser detected speech signal was successfully enhanced.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spectrogram-based speech enhancement by spatial attention generative adversarial networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance analysis of neural network, NMF and statistical approaches for speech enhancement
Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
International Journal of Speech Technology | VOL. 23
Ravi Kumar Kandagatla, et. al.Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
17 Sep 2020
International Journal of Speech Technology | VOL. 23

Speech Enhancement based on Deep Convolutional Neural Network
Ramesh Nuthakki ... Yukta T N
-
Ramesh Nuthakki, et. al.Ramesh Nuthakki ... Yukta T N
11 Nov 2021
11 Nov 2021

Perceptual Loss Function for Speech Enhancement Based on Generative Adversarial Learning
Xin Bai ... Haifeng Huang
-
Xin Bai, et. al.Xin Bai ... Haifeng Huang
07 Nov 2022
07 Nov 2022

New research on monaural speech segregation based on quality assessment
Xiaoping Xie ... Fei Ding
Computer Speech & Language | VOL. 85
Xiaoping Xie, et. al.Xiaoping Xie ... Fei Ding
05 Dec 2023
Computer Speech & Language | VOL. 85

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spectrogram-based speech enhancement by spatial attention generative adversarial networks

Abstract

Talk to us

Similar Papers