Abstract

The human vocal tract system is commonly modeled by a linear predictive coding (LPC) filter whose coefficients are transformed into a line spectral frequency (LSF) vector for quantization. Predictive split-vector quantization (PSVQ) based on an auto-regressive model (AR-PSVQ), which exploits the inter-frame correlation of the LSF vectors, provides a better rate-distortion performance compared with quantization methods that only consider the intra-frame correlation. In the proposed conditional PSVQ (C-PSVQ), the conditional distribution of the current-frame LSF given the previous-frame LSF is taken into account. Compared with AR-PSVQ, C-PSVQ gains 1 bit in terms of average spectral distortion and 2 bits in terms of the number of outlier frames. Memory requirements and computational complexity of C-PSVQ are similar to those of AR-PSVQ.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.