Abstract

Auditory models might be applied to speech coding either as objective evaluation measures, or as coder stages in coder/decoder systems. Both applications are based on the premise that these models accurately represent all the information that is relevant to auditory perception. In this talk, two fundamental problems that need to be worked out in order to make auditory models practically useful in speech coding are identified: the choice of a distance measure for comparing model outputs, and the design of the decoder. These ideas are illustrated using a model of peripheral auditory processing that is based on physiological recordings from auditory-nerve fibers, and predicts psychophysical performance in masking and discrimination for a variety of stimulus conditions. In this model, intensity difference limens in restricted regions of the frequency spectrum depend on overall spectral shape and sound level, so that the coding accuracy in different frequency bands would have to vary with time. Model outputs will be compared for natural and coded speech, and the role of nonlinear phenomena in the inner ear (such as two-tone suppression ) will be discussed. [Work supported by NIH Grant NS13126.]

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.