Robust speech recognition using harmonic features

Yeh Huann Goh,Sudhanshu Shekhar Jamuar,Paramesran Raveendran

doi:10.1049/iet-spr.2013.0094

Yeh Huann Goh, Sudhanshu Shekhar Jamuar + Show 1 more

https://doi.org/10.1049/iet-spr.2013.0094

Copy DOI

Journal: IET Signal Processing	Publication Date: Apr 1, 2014
Citations: 12

Affiliation: University of Malaya

Abstract

In this study, the authors propose a speech recognition system using harmonic structure related information to detect harmonic features in noisy environment. The proposed algorithm first extracts the harmonic components contained inside the speech signals using sine function convolution. By setting the frequency of the sine function as equal to the fundamental frequency of speech signals, harmonic components can be extracted out. The reconstructed signal obtained by summing up the extracted harmonic components is found to have a high degree of correlation with the original signal. The extracted frame energy measure of the harmonic components has been further processed to become dynamic harmonic features and then used together with the European Telecommunications Standards Institute (ETSI) front-end processed mel-frequency cepstral coefficients (MFCC) feature or the perceptual linear prediction (PLP) feature in the speech recognition system. The proposed enhanced speech recognition system shows a better recognition rate over the ETSI front-end processed MFCC (or PLP)-based speech recognition system.

Full Text