Compensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition

Youngjoo Suh,Hoirin Kim,Sungtak Kim

doi:10.1155/2007/67870

Abstract

A new class-based histogram equalization method is proposed for robust speech recognition. The proposed method aims at not only compensating for an acoustic mismatch between training and test environments but also reducing the two fundamental limitations of the conventional histogram equalization method, the discrepancy between the phonetic distributions of training and test speech data, and the nonmonotonic transformation caused by the acoustic mismatch. The algorithm employs multiple class-specific reference and test cumulative distribution functions, classifies noisy test features into their corresponding classes, and equalizes the features by using their corresponding class reference and test distributions. The minimum mean-square error log-spectral amplitude (MMSE-LSA)-based speech enhancement is added just prior to the baseline feature extraction to reduce the corruption by additive noise. The experiments on the Aurora2 database proved the effectiveness of the proposed method by reducing relative errors by over the mel-cepstral-based features and by over the conventional histogram equalization method, respectively.

Highlights

The performance of automatic speech recognition (ASR) systems degrades severely when they are employed in acoustically mismatched environments compared to the training ones
As a feature space compensation approach for robust speech recognition, the conventional histogram equalization (HEQ) technique can be effectively utilized to compensate for the acoustic mismatch between training and test environments
The conventional HEQ has two fundamental limitations caused by the mismatch of phonetic class distributions between training and test data and by the nonmonotonic transformation resulted from the acoustic mismatch

Summary

INTRODUCTION

The performance of automatic speech recognition (ASR) systems degrades severely when they are employed in acoustically mismatched environments compared to the training ones. The major feature space approaches to reducing the nonlinear behaviors of the acoustic mismatch are based on the piecewise linear approximation, such as interacting multiple model (IMM) [6] and stereo-based piecewise linear compensation for environments (SPLICE) [7]. Based on the fact that HEQ is not able to compensate for the adverse effect caused by temporally random behavior of noise, we introduce the minimum mean-square error log-spectral amplitude (MMSE-LSA)-based speech enhancement technique [19] that is used as a front-end preprocessor to HEQ to further reduce the acoustic mismatch.

SPEECH ENHANCEMENT BASED ON MMSE-LSA

CONVENTIONAL HISTOGRAM EQUALIZATION

Basic algorithm

Class-tying technique

Speech database and feature extraction

Speech recognition results

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Mar 22, 2007
Citations: 26	License type: cc-by

R Discovery Prime

R Discovery Prime

Compensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

Class-Based Histogram Equalization for Robust Speech Recognition
... Hoirin Kim
ETRI Journal | VOL. 28
, et. al. ... Hoirin Kim
08 Aug 2006
ETRI Journal | VOL. 28

An adaptive contrast enhancement using regional dynamic histogram equalization
T Iwanami ... M Sakurai
-
T Iwanami, et. al.T Iwanami ... M Sakurai
01 Jan 2012
01 Jan 2012

Survey of Contrast Enhancement Techniques based on Histogram Equalization
Manpreet Kaur ... Jasdeep Kaur
International Journal of Advanced Computer Science and Applications | VOL. 2
Manpreet Kaur, et. al.Manpreet Kaur ... Jasdeep Kaur
01 Jan 2010
International Journal of Advanced Computer Science and Applications | VOL. 2

Image Quality Analysis of a Novel Histogram Equalization Method for Image Contrast Enhancement
Fan-Chieh Cheng ... Shanq-Jang Ruan
IEICE Transactions on Information and Systems | VOL. E93-D
Fan-Chieh Cheng, et. al.Fan-Chieh Cheng ... Shanq-Jang Ruan
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing