An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting

Yue Gu,Xueliang Zhang,Hui Zhang,Zhihao Du

doi:10.1007/978-3-030-63830-6_2

Abstract

AbstractIn real-world applications, robustness against noise is crucial for small-footprint keyword spotting (KWS) systems which are deployed on resource-limited devices. To improve the noise robustness, a reasonable approach is employing a speech enhancement model to enhance the noisy speeches first. However, current enhancement models need a lot of parameters and computation, which do not satisfy the small-footprint requirement. In this paper, we design a lightweight enhancement model, which consists of the convolutional layers for feature extracting, recurrent layers for temporal modeling and deconvolutional layers for feature recovering. To reduce the mismatch between the enhanced features and KWS system desired ones, we further propose an efficient joint training framework, in which the enhancement model and KWS system are concatenated and jointly fine-tuned through a trainable feature transformation block. With the joint training, linguistic information can back-propagate from the KWS system to the enhancement model and guide its training. Our experimental results show that the proposed small-footprint enhancement model significantly improves the noise robustness of KWS systems without much increasing model or computation complexity. Moreover, the recognition performance can be further improved through the proposed joint training framework.KeywordsSmall footprintRobust KWSSpeech enhancement

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques
B V Sandeep Reddy ... S R Mahadeva Prasanna
-
B V Sandeep Reddy, et. al.B V Sandeep Reddy ... S R Mahadeva Prasanna
01 Nov 2008
01 Nov 2008

Developing STT and KWS systems using limited language resources
Viet-Bac Le ... Jean-Luc Gauvain
-
Viet-Bac Le, et. al.Viet-Bac Le ... Jean-Luc Gauvain
14 Sep 2014
14 Sep 2014

Different confidence measures for word verification in speech recognition
M.C Benı́Tez ... A De La Torre
Speech Communication | VOL. 32
M.C Benı́Tez, et. al.M.C Benı́Tez ... A De La Torre
14 Aug 2000
Speech Communication | VOL. 32

An End-to-End Far-Field Keyword Spotting System with Neural Beamforming
Xuan Ji ... Ming Liu
-
Xuan Ji, et. al.Xuan Ji ... Ming Liu
13 Dec 2021
13 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Joint Training Framework for Robust Small-Footprint Keyword Spotting

Abstract

Talk to us

Similar Papers