Multi-Resolution Stacked 1D-CNN for Small-Footprint keyword Spotting with Two-Stage Detection

Jian Tang,Shaofei Xue

doi:10.1109/iscslp57327.2022.10038235

Abstract

Keyword spotting (KWS) is an important technique to free users’ hands in man-machine communication. It is quite challenging to build a system with both low False Reject Ratio (FRR) and low False Alarm Ratio (FAR) for real scenarios, especially when computational resources are limited. In this paper, we propose a two-stage KWS system to obtain the trade-off between low computation and high performance. To meet the low-computation requirement, we propose an acoustic model based on multi-resolution GLU stacked 1D convolutional neural network (MRG-SID). The second requirement is achieved by a second stage classification strategy, in which the neural network features are selected as classifier input for final wakeup word detection. Without increasing the relative FRR, it can reduce the FAR by introducing a few network parameters only. Experiments on a 10K hours Mandarin dataset show that the proposed model can achieve a 39.8% relative FRR reduction compared to the traditional Stacked 1D-CNN. With the second stage classifier, we are further able to reduce the FAR relatively by about 70%. In total, our proposed system significantly leads to a 62.1% relative FRR reduction at 0.1 false alarm per hour.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Resolution Stacked 1D-CNN for Small-Footprint keyword Spotting with Two-Stage Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A 5-yr Climatology of Tornado False Alarms
J Brotzge ... S Erickson
Weather and Forecasting | VOL. 26
J Brotzge, et. al.J Brotzge ... S Erickson
01 Aug 2011
Weather and Forecasting | VOL. 26

Cry Wolf Effect? Evaluating the Impact of False Alarms on Public Responses to Tornado Alerts in the Southeastern United States
Brooke Fisher Liu ... Michael Egnoto
Weather, Climate, and Society | VOL. 11
Brooke Fisher Liu, et. al.Brooke Fisher Liu ... Michael Egnoto
11 Jun 2019
Weather, Climate, and Society | VOL. 11

Evaluation of real-time satellite rainfall products in semi-arid/arid Australia
...
-
, et. al. ...
01 Dec 2013
01 Dec 2013

False Alarms, Tornado Warnings, and Tornado Casualties
Kevin M Simmons ... Daniel Sutter
Weather, Climate, and Society | VOL. 1
Kevin M Simmons, et. al.Kevin M Simmons ... Daniel Sutter
01 Oct 2009
Weather, Climate, and Society | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Resolution Stacked 1D-CNN for Small-Footprint keyword Spotting with Two-Stage Detection

Abstract

Talk to us

Similar Papers