Research on Bandwidth Mismatch Compensation in Speech Recognition

Yong-Jun He,Ji-Qing Han

doi:10.3724/sp.j.1016.2011.01629

Abstract

Speech recognition systems obtaining high recognition rates in clean environments perform badly in mismatch environments without compensation.Based on the research,we found that bandwidth mismatch,namely the bandwidth difference between the training and test conditions,is one of the main factors leading to environment mismatch.When the bandwidth of the test speech is narrower than that of the training speech,the distortion is non-invertible and time-varying in the logarithm spectrum and cepstrum domains.So it could not be compensated with current channel compensation methods.After analyzing the Mel-frequency cepstrum coefficient distortion caused by the lost frequency band,we propose a compensation method based on spectral fold.Furthermore,we provide an algorithm for speech bandwidth detection and a unified compensation framework.Experiments on the AN4 and TIMIT/TIMIT databases show that the proposed framework improved the robustness of speech recognition underbandwidth mismatch conditions.

Full Text