Vector quantization of LSF parameters with a mixture of dirichlet distributions

Zhanyu Ma,Arne Leijon,W Bastiaan Kleijn

doi:10.1109/tasl.2013.2238732

Abstract

Quantization of the linear predictive coding parameters is an important part in speech coding. Probability density function (PDF)-optimized vector quantization (VQ) has been previously shown to be more efficient than VQ based only on training data. For data with bounded support, some well-defined bounded-support distributions (e.g., the Dirichlet distribution) have been proven to outperform the conventional Gaussian mixture model (GMM), with the same number of free parameters required to describe the model. When exploiting both the boundary and the order properties of the line spectral frequency (LSF) parameters, the distribution of LSF differences LSF can be modelled with a Dirichlet mixture model (DMM). We propose a corresponding DMM based VQ. The elements in a Dirichlet vector variable are highly mutually correlated. Motivated by the Dirichlet vector variable's neutrality property, a practical non-linear transformation scheme for the Dirichlet vector variable can be obtained. Similar to the Karhunen-Loeve transform for Gaussian variables, this non-linear transformation decomposes the Dirichlet vector variable into a set of independent beta-distributed variables. Using high rate quantization theory and by the entropy constraint, the optimal inter- and intra-component bit allocation strategies are proposed. In the implementation of scalar quantizers, we use the constrained-resolution coding to approximate the derived constrained-entropy coding. A practical coding scheme for DVQ is designed for the purpose of reducing the quantization error accumulation. The theoretical and practical quantization performance of DVQ is evaluated. Compared to the state-of-the-art GMM-based VQ and recently proposed beta mixture model (BMM) based VQ, DVQ performs better, with even fewer free parameters and lower computational cost

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Vector quantization of LSF parameters with a mixture of dirichlet distributions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2011
Citations: 52

Similar Papers

Dirichlet mixture modeling to estimate an empirical lower bound for LSF quantization
Zhanyu Ma ... Jun Guo
Signal Processing | VOL. 104
Zhanyu Ma, et. al.Zhanyu Ma ... Jun Guo
30 Apr 2014
Signal Processing | VOL. 104

PDF-optimized LSF vector quantization based on beta mixture models
Zhanyu Ma ... Arne Leijon
-
Zhanyu Ma, et. al.Zhanyu Ma ... Arne Leijon
26 Sep 2010
26 Sep 2010

Modelling speech line spectral frequencies with dirichlet mixture models
Zhanyu Ma ... Arne Leijon
-
Zhanyu Ma, et. al.Zhanyu Ma ... Arne Leijon
26 Sep 2010
26 Sep 2010

A Kalman filtering approach to GMM predictive coding of LSFS for packet loss conditions
Shaminda Subasingha ... Manohar N Murthiy
-
Shaminda Subasingha, et. al.Shaminda Subasingha ... Manohar N Murthiy
01 Jul 2009
01 Jul 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Vector quantization of LSF parameters with a mixture of dirichlet distributions

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing