Predictive coding of speech using microphone/speaker adaptation and vector quantization

A.I Aarskog,H.C Guren

doi:10.1109/89.279275

Abstract

A problem with speech coders based on trained quantizers is their lack of robustness against variations in the microphone and input filter response. In this paper a simple backward adaptive prefiltering technique is proposed as a means of improving the robustness and quality of a speech coder at no cost in bit rate. The technique is particularly useful in conjunction with vector quantization (VQ) of the linear predictive coding (LPC) parameters. The performance of the prefilter, denoted a microphone and speaker adaptation (MSA) filter, has been evaluated in terms of prediction gain and spectral distortion, together with objective and subjective quality of a 7.5 kbit/s CELP speech coder. In this coder a 10-bit direct VQ of the LPC parameters using the residual energy distortion measure has been applied. This is consistent with the covariance method of LPC analysis. Simulation results illustrate that the MSA filter significantly improves the performance and robustness of the LPC VQ against changes in the input response. The 7.5 kbit/s CELP with a trained excitation codebook and MSA was found to be clearly better (subjectively and objectively) than the one without MSA. The coder with MSA also showed to be practically indistinguishable from the same CELP with unquantized LPC coefficients and a stochastic excitation codebook. >

Full Text