Small-Footprint Wake Up Word Recognition in Noisy Environments Employing Competing-Words-Based Feature

Ki-Mu Yoon,Wooil Kim

doi:10.3390/electronics9122202

Ki-Mu Yoon, Wooil Kim

Open Access

PDF Available

https://doi.org/10.3390/electronics9122202

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

This paper proposes a small-footprint wake-up-word (WUW) recognition system for real noisy environments by employing the competing-words-based feature. Competing-words-based features are generated using a ResNet-based deep neural network with small parameters using the competing-words dataset. The competing-words dataset consists of the most acoustically similar and dissimilar words to the WUW used for our system. The obtained features are used as input to the classification network, which is developed using the convolutional neural network (CNN) model. To obtain sufficient data for training, data augmentation is performed by using a room impulse response filter and adding sound signals of various television shows as background noise, which simulates an actual living room environment. The experimental results demonstrate that the proposed WUW recognition system outperforms the baselines that employ CNN and ResNet models. The proposed system shows 1.31% in equal error rate and 1.40% false rejection rate at a 1.0% false alarm rate, which are 29.57% and 50.00% relative improvements compared to the ResNet system, respectively. The number of parameters used for the proposed system is reduced by 83.53% compared to the ResNet system. These results prove that the proposed system with the competing-words-based feature is highly effective at improving WUW recognition performance in noisy environments with a smaller footprint.

Highlights

As speech recognition systems use large amount of resources, to minimize computational load, many systems employ wake-up-word (WUW) recognition so that they can be awakened to an active mode once WUW is recognized
To obtain sufficient data for training, data augmentation is performed by using a room impulse response filter and adding sound signals of various television shows as background noise, which simulates an actual living room environment
We proposed a small-footprint WUW recognition system for noisy environments by employing the competing-words-based feature

Summary

Introduction

As speech recognition systems use large amount of resources, to minimize computational load, many systems employ wake-up-word (WUW) recognition so that they can be awakened to an active mode once WUW is recognized. Many studies on small-footprint keyword spotting have shown effectiveness by employing different types of deep networks, including convolutional neural networks (CNN) [4], convolution. We propose utilizing competing words in order to improve WUW recognition performance and minimize the model size of the system. A high-level feature was generated using the competing-words dataset and the residual network. The competing-words-based feature was used as an input to the CNN-based network for classification network training. For small-footprint systems, we focused on minimizing the size of the model parameters as well as increasing the recognition accuracy.

Proposed WUW Recognition System

Selection of Competing Words

Generation of Competing-Words-Based Feature

Configurations of thegeneration

Classification Network

Analysis of the Competing-Words Network-Based Feature

Distribution contour curves of two-dimensional vector through

Database

Experimental Results

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Dec 21, 2020
Citations: 2	License type: CC BY 4.0

R Discovery Prime

Small-Footprint Wake Up Word Recognition in Noisy Environments Employing Competing-Words-Based Feature

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Tunnel boring machine vibration-based deep learning for the ground identification of working faces
Mengbo Liu ... Yanqing Men
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13
Mengbo Liu, et. al.Mengbo Liu ... Yanqing Men
01 Dec 2021
Journal of Rock Mechanics and Geotechnical Engineering | VOL. 13

Two-stage Strategy for Small-footprint Wake-up-word Speech Recognition System
Xinya You ... Yajie Zhao
-
Xinya You, et. al.Xinya You ... Yajie Zhao
01 Jul 2020
01 Jul 2020

On-Device System for Device Directed Speech Detection for Improving Human Computer Interaction
Abhishek Singh ... Rituraj Kabra
IEEE Access | VOL. 9
Abhishek Singh, et. al.Abhishek Singh ... Rituraj Kabra
01 Jan 2020
IEEE Access | VOL. 9

Use of deep learning in the MRI diagnosis of Chiari malformation type I
Kaishin W Tanaka ... Sidong Liu
Neuroradiology | VOL. 64
Kaishin W Tanaka, et. al.Kaishin W Tanaka ... Sidong Liu
24 Feb 2022
Neuroradiology | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Small-Footprint Wake Up Word Recognition in Noisy Environments Employing Competing-Words-Based Feature

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Electronics