Time-interval information in the auditory representation of speech sounds

Roy D Patterson

doi:10.1121/1.424756

Abstract

In the auditory system, the primary fibers that encode mechanical motion of the basilar partition are phase-locked to that motion, and this information is preserved, to varying degrees, up to the inferior colliculus. It is known that this timing-interval information is used in localization, and it is probably also used to separate sources from diffuse background noise. The time intervals are on the order of milliseconds, and so traditional speech preprocessors (like MFCC systems) with frames on the order of 15 ms, remove the time-interval information from the representation. The performance of these systems deteriorates badly when the speaker is in a noisy environment. This suggests that time-interval processing will eventually need to be integrated into speech recognition systems if they are to achieve the kind of noise resistance characteristic of human speech recognition. An auditory image model (AIM) will be presented that is designed to stabilize repeating time-interval patterns like those produced by voiced speech, and results from experiments where AIM has been used as a preprocessor for automatic speech recognition. [Work supported by UK MRC (G9703469).]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Time-interval information in the auditory representation of speech sounds

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Feb 1, 1999
Citations: 1

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

On using the auditory image model and invariant-integration for noise robust automatic speech recognition
Florian Muller ... Alfred Mertins
-
Florian Muller, et. al.Florian Muller ... Alfred Mertins
01 Mar 2012
01 Mar 2012

Auditory models and nonlinear filterbanks in underwater auralization
Stefan Bleeck ... Paul R White
The Journal of the Acoustical Society of America | VOL. 123
Stefan Bleeck, et. al.Stefan Bleeck ... Paul R White
01 May 2008
The Journal of the Acoustical Society of America | VOL. 123

Power Spectrum Difference Teager Energy Features for Speech Recognition in Noisy Environment
N S Nehe ... R.S Holambe
-
N S Nehe, et. al.N S Nehe ... R.S Holambe
01 Dec 2008
01 Dec 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Time-interval information in the auditory representation of speech sounds

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America