Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model

Faten Ben Ali,Laurent Girin,Sonia Djaziri-Larbi

doi:10.17743/jaes.2016.0028

Abstract

The long-term harmonic plus noise model (LT-HNM) for speech shows an interesting data compression, since it exploits the smooth evolution of the time trajectories of the short-term harmonic plus noise model parameters, by applying a discrete cosine model (DCM). In this paper, we extend the LT-HNM to a complete low bit-rate speech coder. A Normalized Split Vector Quantization (NSVQ) is proposed to quantize the variable dimension LT-DCM vectors. The NSVQ is designed according to the properties of the DCM vectors obtained from a standard speech database. The obtained LT-HNM coder reaches an average bit-rate of 2.7kbps for wideband speech. The proposed coder is evaluated in terms of modeling and coding errors, bit-rate, listening quality and intelligibility. Index Terms Low bit-rate, speech coding, long term modeling, harmonic plus noise model, variable dimension vector quantization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of the Audio Engineering Society	Publication Date: Dec 1, 2016
Citations: 3	License type: other-oa

R Discovery Prime

R Discovery Prime

Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model

Abstract

Talk to us

Similar Papers

More From: Journal of the Audio Engineering Society

Lead the way for us

Similar Papers

A novel approach to variable dimension vector quantization of harmonic magnitudes
W.C Chu
-
W.C ChuW.C Chu
18 Sep 2003
18 Sep 2003

Variable-dimension vector quantization
A Das ... A.V Rao
IEEE Signal Processing Letters | VOL. 3
A Das, et. al.A Das ... A.V Rao
01 Jul 1996
IEEE Signal Processing Letters | VOL. 3

Variable dimension spectral coding of speech at 2400 bps and below with phonetic classification
A Das ... A Gersho
-
A Das, et. al.A Das ... A Gersho
09 May 1995
Variable dimension spectral coding of speech at 2400 bps and below with phonetic classification
A Das ... A Gersho

A long term harmonic plus noise model for narrow-band speech coding at very low bit-rates
Faten Ben Ali ... Sonia Djaziri-Larbi
-
Faten Ben Ali, et. al.Faten Ben Ali ... Sonia Djaziri-Larbi
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model

Abstract

Talk to us

Similar Papers

More From: Journal of the Audio Engineering Society