Code Excited Linear Prediction Research Articles

Speech coders operating at low bit rates necessitate efficient encoding of the linear predictive coding (LPC) coefficients. Line spectral frequencies (LSF) parameters are currently one of the most efficient choices of transmission parameters for the LPC coefficients. In this paper, an optimized trellis coded vector quantization (TCVQ) scheme for encoding the LSF parameters is developed. When the selection of a proper distortion measure is the most important issue in the design and operation of the encoder, an appropriate weighted distance measure has been used during the TCVQ construction process. Using this distance, we will show that the LSF TCVQ encoder performs better than the encoder conceived with the unweighted distance and a reduction of about 1–2 bits/frame is obtained while maintaining the same performance. We further applied the TCVQ encoder system for encoding the LSF parameters of the US federal standard (FS1016) 4.8 kbps code excited linear prediction (CELP) speech coder. At lower bit rates, our objective and subjective evaluation results show that the incorporated LSF TCVQ encoder performs better than the 34 bits/frame LSF scalar quantizer used originally in the FS1016 coder. The subjective tests reveal also that the 27 bits/frame scheme produces equivalent perceptual quality to that when the LSF parameters are unquantized.

Read full abstract

This paper presents several strategies to improve the performance of very low bit rate speech coders and describes a speech codec that incorporates these strategies and operates at an average bit rate of 1.2 kb/s. The encoding algorithm is based on several improvements in a mixed multiband excitation (MMBE) linear predictive coding (LPC) structure. A switched-predictive vector quantiser technique that outperforms previously reported schemes is adopted to encode the LSF parameters. Spectral and sound specific low rate models are used in order to achieve high quality speech at low rates. An MMBE approach with three sub-bands is employed to encode voiced frames, while fricatives and stops modelling and synthesis techniques are used for unvoiced frames. This strategy is shown to provide good quality synthesised speech, at a bit rate of only 0.4 kb/s for unvoiced frames. To reduce coding noise and improve decoded speech, spectral envelope restoration combined with noise reduction (SERNR) postfilter is used. The contributions of the techniques described in this paper are separately assessed and then combined in the design of a low bit rate codec that is evaluated against the North American Mixed Excitation Linear Prediction (MELP) coder. The performance assessment is carried out in terms of the spectral distortion of LSF quantisation, mean opinion score (MOS), A/B comparison tests and the ITU-T P.862 perceptual evaluation of speech quality (PESQ) standard. Assessment results show that the improved methods for LSF quantisation, sound specific modelling and synthesis and the new postfiltering approach can significantly outperform previously reported techniques. Further results also indicate that a system combining the proposed improvements and operating at 1.2 kb/s, is comparable (slightly outperforming) a MELP coder operating at 2.4 kb/s. For tandem connection situations, the proposed system is clearly superior to the MELP coder.

Read full abstract

Code Excited Linear Prediction Research Articles

Related Topics

Articles published on Code Excited Linear Prediction

Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility

Bit rate reduction of mixed excitation linear prediction coder by Lempel-Ziv segment quantization

Encoding and decoding method and apparatus using rising-transition detection and notification

Switching Search Method for Pulse Assignment in ITU-T G.729D

Decoder Initializing Technique for Improving Frame-Erasure Resilience of a CELP Speech Codec

Codebook Based Digital Speech Compression

Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure

Soft Reconstruction of Speech in the Presence of Noise and Packet Loss

Compression of surface EMG signals with algebraic code excited linear prediction

Efficient algebraic code-excited linear predictive codebook search

Multiband Vector Quantization Based on Inner Product for Wideband Speech Coding

Signal modification method for variable bit rate wide-band speech coding

Optimized trellis coded vector quantization of LSF parameters, application to the 4.8 kbps FS1016 speech coder

Voice Activity Detection Algorithm Based on Radial Basis Function Network

Watermarking Combined with CELP Speech Coding for Authentication

Efficient two-stage vector quantization speech coder using wavelet coefficients of excitation signals

Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec

A Noise Reduction Preprocessor for Mobile Voice Communication

KLT-Based Adaptive Classified VQ of the Speech Signal

The 2003 Benjamin Franklin medal in electrical engineering presented to Bishnu S. Atal

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Code Excited Linear Prediction Research Articles

Related Topics

Articles published on Code Excited Linear Prediction

Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility

Bit rate reduction of mixed excitation linear prediction coder by Lempel-Ziv segment quantization

Encoding and decoding method and apparatus using rising-transition detection and notification

Switching Search Method for Pulse Assignment in ITU-T G.729D

Decoder Initializing Technique for Improving Frame-Erasure Resilience of a CELP Speech Codec

Codebook Based Digital Speech Compression

Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure

Soft Reconstruction of Speech in the Presence of Noise and Packet Loss

Compression of surface EMG signals with algebraic code excited linear prediction

Efficient algebraic code-excited linear predictive codebook search

Multiband Vector Quantization Based on Inner Product for Wideband Speech Coding

Signal modification method for variable bit rate wide-band speech coding

Optimized trellis coded vector quantization of LSF parameters, application to the 4.8 kbps FS1016 speech coder

Voice Activity Detection Algorithm Based on Radial Basis Function Network

Watermarking Combined with CELP Speech Coding for Authentication

Efficient two-stage vector quantization speech coder using wavelet coefficients of excitation signals

Strategies to improve the performance of very low bit rate speech coders and application to a variable rate 1.2 kb/s codec

A Noise Reduction Preprocessor for Mobile Voice Communication

KLT-Based Adaptive Classified VQ of the Speech Signal

The 2003 Benjamin Franklin medal in electrical engineering presented to Bishnu S. Atal