Abstract

For efficient coding of speech, it is desirable to separate the slowly and rapidly evolving spectral components to take advantage of their different perceptual qualities. We present a multi-level wavelet decomposition mechanism, using low delay FIR filters, applied to waveform interpolation (WI) coding. The technique overcomes the substantial delay problems encountered by Chong, Burnett, Chicharo and Thomson (see Proc. Int Conf. Acoustics, Speech, Sig. Processing, Seattle, USA, vol. 1., p.513-16, 1998) and identifies a preferred technique for the quantisation of the decomposed surfaces. The phase is shown to be particularly sensitive to the compounding of quantisation errors within the tree-structured transform. The proposed solution involves the use of variable dimension vector quantisation (VDVQ) on separately decomposed magnitude/phase surfaces. This approach provides for coarse or no phase quantisation while maintaining high speech quality. The techniques discussed may also be applied to other transforms and to the quantisation of surfaces in the standard waveform interpolation coder.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.