A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments

J S Yoon,G H Lee,H K Kim

doi:10.1093/ietfec/e90-a.3.626

Abstract

Existing standard speech coders can provide high quality speech communication. However, they tend to degrade the performance of automatic speech recognition (ASR) systems that use the reconstructed speech. The main cause of the degradation is in that the linear predictive coefficients (LPCs), which are typical spectral envelope parameters in speech coding, are optimized to speech quality rather than to the performance of speech recognition. In this paper, we propose a speech coder using mel-frequency cepstral coefficients (MFCCs) instead of LPCs to improve the performance of a server-based speech recognition system in network environments. To develop the proposed speech coder with a low-bit rate, we first explore the interframe correlation of MFCCs, which results in the predictive quantization of MFCC. Second, a safety-net scheme is proposed to make the MFCC-based speech coder robust to channel errors. As a result, we propose an 8.7 kbps MFCC-based CELP coder. It is shown that the proposed speech coder has a comparable speech quality to 8 kbps G.729 and the ASR system using the proposed speech coder gives the relative word error rate reduction by 6.8% as compared to the ASR system using G.729 on a large vocabulary task (AURORA4).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

Lead the way for us

Journal: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences	Publication Date: Mar 1, 2007
Citations: 9

Similar Papers

Design of a speech coder utilizing speech recognition parameters for server-based wireless speech recognition
Gil Ho Lee ... Yoo Rhee Oh
-
Gil Ho Lee, et. al. Gil Ho Lee ... Yoo Rhee Oh
18 Nov 2004
18 Nov 2004

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Using DTW neural–based MFCC warping to improve emotional speech recognition
Mansour Sheikhan ... Davood Gharavian
Neural Computing and Applications | VOL. 21
Mansour Sheikhan, et. al.Mansour Sheikhan ... Davood Gharavian
15 May 2011
Neural Computing and Applications | VOL. 21

Autocorrelation-based Methods for Noise-Robust Speech Recognition
Gholamreza Farahani ... Mohammad Mehdi
-
Gholamreza Farahani, et. al.Gholamreza Farahani ... Mohammad Mehdi
01 Jun 2007
01 Jun 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A MFCC-Based CELP Speech Coder for Server-Based Speech Recognition in Network Environments

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences