A model of auditory perception as front end for automatic speech recognition.

Jürgen Tchorz,Birger Kollmeier

doi:10.1121/1.427950

Abstract

A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The results show a higher robustness of the auditory front end in noise, compared to common mel-scale cepstral feature extraction. In a second set of experiments, different processing stages of the auditory front end were modified to study their contribution to robust speech signal representation in detail. The adaptive compression stage which enhances temporal changes of the input signal appeared to be the most important processing stage towards robust speech representation in noise. Low-pass filtering of the fast fluctuating envelope in each frequency band further reduces the influence of noise in the auditory-based representation of speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A model of auditory perception as front end for automatic speech recognition.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Oct 1, 1999
Citations: 116

Similar Papers

PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception
R Huber ... B Kollmeier
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
R Huber, et. al.R Huber ... B Kollmeier
01 Nov 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

Auditory masking based acoustic front-end for robust speech recognition
K.K Paliwal ... B.T Lilly
-
K.K Paliwal, et. al.K.K Paliwal ... B.T Lilly
02 Dec 1997
02 Dec 1997

Combining speech enhancement and auditory feature extraction for robust speech recognition
Michael Kleinschmidt ... Birger Kollmeier
Speech Communication | VOL. 34
Michael Kleinschmidt, et. al.Michael Kleinschmidt ... Birger Kollmeier
14 Feb 2001
Speech Communication | VOL. 34

On the Use of a Robust Speech Representation
Jean-Claude Junqua ... Jean-Paul Haton
-
Jean-Claude Junqua, et. al.Jean-Claude Junqua ... Jean-Paul Haton
01 Jan 1996
01 Jan 1996

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A model of auditory perception as front end for automatic speech recognition.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America