Development of a 4 kbit/s hybrid sinusoidal/CELP speech coder

Ari Heikkinen

doi:10.1016/j.specom.2003.10.004

Abstract

A comprehensive performance analysis of sinusoidal and code excited linear prediction (CELP) speech coding is given around 4 kbit/s, using both subjective and objective measurements. Based on the observations made, justification for the multi-modal hybrid coding approach employing both sinusoidal and CELP coding is given, and an implementation of such a coder is described. This 4 kbit/s sinusoidal/CELP speech coder utilizes four modes to classify the input speech segment: voiced, jittery-voiced, plosive and unvoiced. For voiced segments sinusoidal coding is used whereas different CELP versions are employed for the other modes. The quality of the implemented 4 kbit/s sinusoidal/CELP speech coder in clean speech conditions is finally verified by a listening test. In the test, the 4 kbit/s coder performed almost as well as the high-quality references used, but it still needs improvements to be classified as a high-quality 4 kbit/s speech coder.

Full Text