Using phase spectrum information for improved speech recognition performance

R Schluter,H Ney

doi:10.1109/icassp.2001.940785

Using phase spectrum information for improved speech recognition performance

R Schluter, H Ney

https://doi.org/10.1109/icassp.2001.940785

Copy DOI

Publication Date: May 7, 2001

Citations: 216

Affiliation: FH Aachen

#Mel Frequency Cepstral Coefficients #Standard Mel Frequency Cepstral Coefficients + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

New acoustic features for continuous speech recognition based on the short-term Fourier phase spectrum are introduced for mono (telephone) recordings. The new phase based features were combined with standard Mel Frequency Cepstral Coefficients (MFCC), and results were produced with and without using additional linear discriminant analysis (LDA) to choose the most relevant features. Experiments were performed on the SieTill corpus for telephone line recorded German digit strings. Using LDA to combine purely phase based features with MFCCs, we obtained improvements in word error rate of up to 25% relative to using MFCCs alone with the same overall number of parameters in the system.

Full Text