Enhanced 2.4 kb/s mixed excitation linear prediction vocoder

Song Du Song Du,Huijuan Cui Huijuan Cui

doi:10.1109/icct.1998.741016

Abstract

We describe the improvements of a voice codec that has been issued as a draft for the US Federal Information Processing Standards-analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction (MELP) on June 12, 1997. In pitch estimation in MELP, a pitch doubling check algorithm and a strong voice pitch smoothing algorithm are applied. However, these algorithms are too simple to compute an accurate and smooth pitch period, and a leap of the pitch happens sometimes, especially during voice transition, about 5% to 10% of the pitch estimates are still not correct. In order to obtain a more accurate and smooth pitch period, a dynamic frame relative smoothing algorithm is applied to optimize the pitch period in MELP. After pitch smoothing almost all the errors are eliminated. In order to fit Chinese, we retrain the prediction parameters codebook for MELP using the simulated annealing algorithm based on a Chinese voice database. The Itakura distance test of distortion is applied, which shows the codebook obtained by the simulated annealing algorithm has less distortion than the codebook obtained by the traditional LBG algorithm. The probability of distortion of the former is 15% greater than the latter, for an Itakura distance between -0.1 and 0.1. The enhanced algorithm gives a better codebook for more fluent Chinese synthetic speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhanced 2.4 kb/s mixed excitation linear prediction vocoder

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An improved mixed excitation linear prediction (MELP) coder
T Unno ... T.P Barnwell
-
T Unno, et. al.T Unno ... T.P Barnwell
01 Jan 1998
01 Jan 1998

Study and development of MELP vocoder
Wenyuan Li ... Yuzhong Zhang
-
Wenyuan Li, et. al. Wenyuan Li ... Yuzhong Zhang
26 Aug 2002
26 Aug 2002

A new 1.2 kb/s speech coding algorithm and its real-time implementation on TMS320LC548
Chen Liang ... Zhang Xiongwei
-
Chen Liang, et. al. Chen Liang ... Zhang Xiongwei
21 Aug 2000
A new 1.2 kb/s speech coding algorithm and its real-time implementation on TMS320LC548
Chen Liang ... Zhang Xiongwei

An extended Levinson-Durbin algorithm and its application in mixed excitation linear prediction
Dong Xiao ... Li Ma
Heliyon | VOL. 4
Dong Xiao, et. al.Dong Xiao ... Li Ma
01 Nov 2018
Heliyon | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced 2.4 kb/s mixed excitation linear prediction vocoder

Abstract

Talk to us

Similar Papers