Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

U Bhaskar,K Swaminathan,S Nandkumar

doi:10.1109/scft.2000.878376

Abstract

The 4.0 kbit/s speech codec described is based on a frequency domain interpolative (FDI) coding technique, which belongs to the class of prototype waveform interpolation (PWI) coding techniques. The codec also has an integrated voice activity detector (VAD) and a noise reduction capability. The input signal is subjected to LPC analysis and the prediction residual is separated into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) component. The SEW magnitude component is quantized using a hierarchical predictive vector quantization approach. The REW magnitude is quantized using a gain and a sub-band based shape. The SEW and REW phases are derived at the decoder using a phase model, based on a transmitted measure of voice periodicity. The spectral (LSP) parameters are quantized using a combination of scalar and vector quantizers. The 4.0 kbits/s coder has an algorithmic delay of 60 ms and an estimated floating point complexity of 21.5 MIPS. The performance of this coder has been evaluated using in-house MOS tests under various conditions such as background noise, channel errors, self-tandem, and DTX mode of operation, and has been shown to be statistically equivalent to ITU-T G.729 8 kbps codec across all conditions tested.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Quantization of SEW and REW components for 3.6 kbit/s coding based on PWI
U Bhaskar ... G Zakaria
-
U Bhaskar, et. al.U Bhaskar ... G Zakaria
20 Jun 1999
20 Jun 1999

Low bit-rate voice compression based on frequency domain interpolative techniques
U Bhaskar ... K Swaminathan
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
U Bhaskar, et. al.U Bhaskar ... K Swaminathan
01 Mar 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

SEW representation for low rate WI coding
J Lukasiak ... I.S Burnett
-
J Lukasiak, et. al.J Lukasiak ... I.S Burnett
06 Feb 2006
06 Feb 2006

Phase adjustment in waveform interpolation
Hong-Goo Kang ... D Sen
-
Hong-Goo Kang, et. al. Hong-Goo Kang ... D Sen
01 Jan 1998
01 Jan 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

Abstract

Talk to us

Similar Papers