Abstract

We have studied experimentally the operational rate-distortion performance for very low bit-rate speech coding using linear inter-frame dependencies. We propose an algorithm that efficiently combines quantization and linear interpolation procedures. With a maximum delay of 200 ms, for the spectral envelope information and using line spectrum pair (LSP) parameters as input space the proposed algorithm performs best at rates of between 200 and 300 b/s. For comparison's sake several other procedures such as the multi-frame encoder (Kemp D., Collura J., Tremain T., Multi-Frame Coding of LPC Parameters at 600–800 bps. In: IEEE ICASSP-91, 1991, pp. 609–612) and matrix quantizer (Tsao C., Gray R., Matrix quantizer design for LPC speech using the generalized Lloyd algorithm. IEEE Transactions on Acoustics, Speech, and Signal Processing ASSP-33, 1985, 537–545) are simulated. Furthermore, a mono-dimensional version of the proposed procedure is shown experimentally to provide the best operational rate-distortion trade-off when coding a parametric representation (pitch, gain and voicing information) of the excitation signal.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.