A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting

Ivan Lopez-Espejo,Zheng-Hua Tan,Jesper Jensen

doi:10.1109/taslp.2021.3092567

Abstract

The development of keyword spotting (KWS) systems that are accurate in noisy conditions remains a challenge. Towards this goal, in this paper we propose a novel training strategy relying on multi-condition training for noise-robust KWS. By this strategy, we think of the state-of-the-art KWS models as the composition of a keyword embedding extractor and a linear classifier that are successively trained. To train the keyword embedding extractor, we also propose a new (C <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">N,2</sub> +1)-pair loss function extending the concept behind related loss functions like triplet and N-pair losses to reach larger inter-class and smaller intra-class variation. Experimental results on a noisy version of the Google Speech Commands Dataset show that our proposal achieves around 12% KWS accuracy relative improvement with respect to standard end-to-end multi-condition training when speech is distorted by unseen noises. This performance improvement is achieved without increasing the computational complexity of the KWS model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2021
Citations: 14

Similar Papers

Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting
Eva Sharma ... Guoli Ye
-
Eva Sharma, et. al.Eva Sharma ... Guoli Ye
01 May 2020
01 May 2020

Keyword Spotting using Vowel Onset Point, Vector Quantization and Hidden Markov Modeling Based techniques
B V Sandeep Reddy ... S R Mahadeva Prasanna
-
B V Sandeep Reddy, et. al.B V Sandeep Reddy ... S R Mahadeva Prasanna
01 Nov 2008
01 Nov 2008

An End-to-End Far-Field Keyword Spotting System with Neural Beamforming
Xuan Ji ... Ming Liu
-
Xuan Ji, et. al.Xuan Ji ... Ming Liu
13 Dec 2021
13 Dec 2021

Different confidence measures for word verification in speech recognition
M.C Benı́Tez ... A De La Torre
Speech Communication | VOL. 32
M.C Benı́Tez, et. al.M.C Benı́Tez ... A De La Torre
14 Aug 2000
Speech Communication | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing