Gsdnet: Gated Self-Supervised Denoising Speech Control Network

Caitong Bai,Ai Li,Xiaolong Cui

doi:10.1088/1742-6596/2033/1/012157

Abstract

The cost of labeling data remains high, even with the effective implementation of deep neural networks in speech recognition. At the same time, noise still hampers the performance of speech-recognition methods. Thus, it is still challenging to make full use of data sets to enhance the robustness of recognition systems. In this letter, we construct GSDNet, a gated self-supervised denoising speech control network that consists of three parts (a denoising feature-extraction frontend, a speech recognition encoder, and a decoder based on gated convolutionary neural networks with self-supervised regression), to provide a low-cost method for training a robust speech recognition system, and we apply it to equipment-control tasks. Finally, the experimental results with the THCH30 and AISHELL data sets for equipment control show that the word error rate is less than 0.2 without a language model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gsdnet: Gated Self-Supervised Denoising Speech Control Network

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Sep 1, 2021
License type: cc-by

Similar Papers

Binary neural networks for speech recognition
Yan-Min Qian ... Xu Xiang
Frontiers of Information Technology & Electronic Engineering | VOL. 20
Yan-Min Qian, et. al.Yan-Min Qian ... Xu Xiang
01 May 2019
Frontiers of Information Technology & Electronic Engineering | VOL. 20

Optimization of Deep Neural Network for Automatic Speech Recognition
Aqbal Waris ... R.K Aggarwal
-
Aqbal Waris, et. al.Aqbal Waris ... R.K Aggarwal
01 Jul 2018
01 Jul 2018

A Language Model Optimization Method for Turkish Automatic Speech Recognition System
Saadin Oyucu ... Hüseyin Polat
Politeknik Dergisi | VOL. 26
Saadin Oyucu, et. al.Saadin Oyucu ... Hüseyin Polat
01 Oct 2023
Politeknik Dergisi | VOL. 26

Unfolded recurrent neural networks for speech recognition
George Saon ... Ahmad Emami
-
George Saon, et. al.George Saon ... Ahmad Emami
14 Sep 2014
14 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gsdnet: Gated Self-Supervised Denoising Speech Control Network

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series