Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices

Ivan Lopez-Espejo,Jesper Jensen,Zheng-Hua Tan

doi:10.1109/taslp.2020.2984089

Ivan Lopez-Espejo, Jesper Jensen + Show 1 more

Open Access

https://doi.org/10.1109/taslp.2020.2984089

Copy DOI

Abstract

For certain applications, keyword spotting (KWS) requires some degree of personalization. This is the case for KWS for hearing assistive devices, e.g., hearing aids, where only the device user should be allowed to trigger the KWS system. In this paper, we first develop a new realistic hearing aid experimental framework. Next, using this framework we show that the performance of a state-of-the-art multi-task deep learning architecture exploiting cepstral features for joint KWS and users’ own-voice/external speaker detection drops significantly. To overcome this problem, we use phase difference information through GCC-PHAT (Generalized Cross-Correlation with PHAse Transform)-based coefficients along with log-spectral magnitude features. In addition, we demonstrate that working in the perceptually-motivated constant-Q transform (CQT) domain instead of in the short-time Fourier transform (STFT) domain allows for the generation of compact and coherent features which provide superior KWS performance. Our experimental results show that our CQT-based proposal achieves a relative KWS accuracy improvement of around 18% compared to using cepstral features while dramatically decreasing the number of multiplications in the multi-task architecture, which is key in the context of low-resource devices like hearing assistive devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2020
Citations: 44	License type: other-oa

R Discovery Prime

R Discovery Prime

Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

Improved estimation of direction of arrival of sound sources for hearing aids using gyroscopic information
Alan W Boyd ... William M Whitmer
-
Alan W Boyd, et. al.Alan W Boyd ... William M Whitmer
01 Jan 2013
01 Jan 2013

Improved estimation of direction of arrival of sound sources for hearing aids using gyroscopic information
Alan W Boyd ... W Owen Brimijoin
The Journal of the Acoustical Society of America | VOL. 133
Alan W Boyd, et. al.Alan W Boyd ... W Owen Brimijoin
01 May 2013
The Journal of the Acoustical Society of America | VOL. 133

A multi-task learning-based framework for global maritime trajectory and destination prediction with AIS data
Wells Wang ... Zheng Liu
Maritime Transport Research | VOL. 3
Wells Wang, et. al.Wells Wang ... Zheng Liu
01 Jan 2021
Maritime Transport Research | VOL. 3

Music to the Impaired or Implanted Ear
Kate Gfeller ... John F Knutson
The ASHA Leader | VOL. 8
Kate Gfeller, et. al.Kate Gfeller ... John F Knutson
01 Apr 2003
The ASHA Leader | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved External Speaker-Robust Keyword Spotting for Hearing Assistive Devices

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing