Mixed Bandwidth Acoustic Modeling Leveraging Knowledge Distillation

Takashi Fukuda,Samuel Thomas

doi:10.1109/asru46091.2019.9003760

Abstract

Training of mixed bandwidth acoustic models have recently been realized by incorporating special Mel filterbanks. To fit information into every filterbank bin available across both narrowband and wideband data, these filterbanks pad zeros at high frequency ranges of narrowband data. Although these methods succeed in decreasing word error rates (WER) on broadband data, they fail to improve on narrowband signals. In this paper, we propose methods to mitigate these effects with generalized knowledge distillation. In our method, specialized teacher networks are first trained on lossless acoustic features with full scale Mel filterbanks. While training student networks, privileged knowledge from these teacher networks is then used to compensate for missing information at high frequencies introduced by the special Mel filterbanks. We show the benefit of the proposed technique for both narrowband (10% relative WER improvement) and wideband data (7.5% relative WER improvement) on the Aurora 4 task over traditional methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mixed Bandwidth Acoustic Modeling Leveraging Knowledge Distillation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An FPGA-based Direct Sampling and Digital Processing System for Wideband and Narrowband Radar Signal
Yingxiao Zhao ... Hongliang You
Journal of Physics: Conference Series | VOL. 1624
Yingxiao Zhao, et. al.Yingxiao Zhao ... Hongliang You
01 Oct 2020
Journal of Physics: Conference Series | VOL. 1624

Optimizing Expected Word Error Rate via Sampling for Speech Recognition
Matt Shannon
-
Matt ShannonMatt Shannon
20 Aug 2017
20 Aug 2017

Human vs machine spoofing detection on wideband and narrowband data
Mirjam Wester ... Zhizheng Wu
-
Mirjam Wester, et. al.Mirjam Wester ... Zhizheng Wu
06 Sep 2015
06 Sep 2015

Privileged Knowledge Distillation for SAR Building Extraction
Eungbean Lee ... Somi Jeong
-
Eungbean Lee, et. al.Eungbean Lee ... Somi Jeong
11 Jul 2021
11 Jul 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mixed Bandwidth Acoustic Modeling Leveraging Knowledge Distillation

Abstract

Talk to us

Similar Papers