We present a new technique that reduces wide-band noise in noisy speech. The concept of the approach is based on residual or voice excited linear predictive coding (LPC). The technique can be used in either a digital or an analog communication link that is degraded by quantization or receiver thermal noise, or both. The technique capitalizes on two effects. (1) Linear predictive coefficients representing an all-pole vocal tract transfer function can be calculated unaffected by noise, if the noise present in speech is additive and white. (2) The signal-to-noise ratio of voiced sound in the fundamental frequency region of the voice band is much higher than that in the high-frequency region. The basic procedure for conditioning noisy speech is as follows. When a voice signal that has been distorted because of quantization or thermal noise, or both, is received at a receiver, the decoded signal is analyzed to determine a set of LPC coefficients by the autocorrelation approach. The noise affects only the diagonal elements of the autocorrelation matrix, since it is not correlated with speech. Hence, to find the coefficients that are unaffected by noise, the noise power is subtracted from the diagonal elements of the matrix. The power of white noise is measured from a speech pause. The residual signal or noisy speech is low-pass filtered at a cutoff frequency of 600 Hz and then passed through a nonlinear distortion device to broaden its bandwidth, This nonlinearly processed signal is used as the input to excite the filter which has been formed by the LPC coefficients. Initial testing of the conditioner by computer simulation has yielded promising results.
Read full abstract