Perceptually Weighted Analysis-by-Synthesis Vector Quantization for Low Bit Rate MFCC Codec

Gang Min,Jibin Yang,Xia Zou,Xiongwei Zhang

doi:10.1109/lsp.2016.2598226

Abstract

This letter presents a perceptually weighted analysis-by-synthesis vector quantization (VQ) algorithm for low bit rate MFCC codec. Different from conventional VQ of mel-frequency cepstral coefficients (MFCCs) vector, this algorithm uses an analysis-by-synthesis technique and aims to minimize the perceptually weighted spectral reconstruction distortion rather than the distortion of MFCCs vector itself. Also, to reduce the computational complexity, we propose a practical suboptimal codebook searching technique and embed it into the split and multistage VQ framework. Objective and subjective experimental results on Mandarin speech show that the proposed algorithm yields intelligible and natural sounding speech for speech coding at 600–2400 bit/s. Compared to current VQ in MFCC codec, the output speech quality is substantially improved in terms of frequency-weighted segmental SNR, short-time objective intelligibility score, perceptual evaluation of speech quality score, and mean opinion score.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Perceptually Weighted Analysis-by-Synthesis Vector Quantization for Low Bit Rate MFCC Codec

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Oct 1, 2016
Citations: 14

Similar Papers

Pitch prediction from Mel-frequency cepstral coefficients using sparse spectrum recovery
M V Achuth Rao ... Prasanta Kumar Ghosh
-
M V Achuth Rao, et. al.M V Achuth Rao ... Prasanta Kumar Ghosh
01 Mar 2017
01 Mar 2017

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
Jonathan Darch ... Saeed Vaseghi
The Journal of the Acoustical Society of America | VOL. 124
Jonathan Darch, et. al.Jonathan Darch ... Saeed Vaseghi
01 Dec 2008
The Journal of the Acoustical Society of America | VOL. 124

Pornographic Audios Detection Using MFCC Features and Vector Quantization
Zhiyi Qu ... Jing Yu
-
Zhiyi Qu, et. al.Zhiyi Qu ... Jing Yu
01 Dec 2010
01 Dec 2010

Frogs Sound Detection to Control The Population of Frogs as Pests Using Mel-frequency Cepstral Coefficient-Vector Quantization (MFCC-VQ) Algorithm
Romi Fadillah Rahmat ... Erna Budhiarti Nababan
-
Romi Fadillah Rahmat, et. al.Romi Fadillah Rahmat ... Erna Budhiarti Nababan
03 Sep 2020
03 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Perceptually Weighted Analysis-by-Synthesis Vector Quantization for Low Bit Rate MFCC Codec

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters