Abstract

Frequency modulation (FM) is a basic constituent of vocalisation in many animals as well as in humans. In human speech, short rising and falling FM-sweeps of around 50 ms duration, called formant transitions, characterise individual speech sounds. There are two representations of FM in the ascending auditory pathway: a spectral representation, holding the instantaneous frequency of the stimuli; and a sweep representation, consisting of neurons that respond selectively to FM direction. To-date computational models use feedforward mechanisms to explain FM encoding. However, from neuroanatomy we know that there are massive feedback projections in the auditory pathway. Here, we found that a classical FM-sweep perceptual effect, the sweep pitch shift, cannot be explained by standard feedforward processing models. We hypothesised that the sweep pitch shift is caused by a predictive feedback mechanism. To test this hypothesis, we developed a novel model of FM encoding incorporating a predictive interaction between the sweep and the spectral representation. The model was designed to encode sweeps of the duration, modulation rate, and modulation shape of formant transitions. It fully accounted for experimental data that we acquired in a perceptual experiment with human participants as well as previously published experimental results. We also designed a new class of stimuli for a second perceptual experiment to further validate the model. Combined, our results indicate that predictive interaction between the frequency encoding and direction encoding neural representations plays an important role in the neural processing of FM. In the brain, this mechanism is likely to occur at early stages of the processing hierarchy.

Highlights

  • Frequency modulation (FM) is a basic acoustic feature of animal vocalisation, human speech, and music

  • The feedback mechanism proposed in this work uses predictions generated by populations encoding FM direction to aid encoding in populations encoding instantaneous frequency, enhancing direction selectivity and shortening FM processing time

  • In this work we have harnessed a well-established perceptual phenomenon to inform a model of FM direction encoding

Read more

Summary

Introduction

Frequency modulation (FM) is a basic acoustic feature of animal vocalisation, human speech, and music. Individual neurons at higher levels of the processing hierarchy (inferior colliculus [3,4,5], medial geniculate body [6, 7], and auditory cortex [8,9,10,11]) encode FM direction and FM rate, by responding selectively to certain rates and direction. We call this latter, more abstract representation, the sweep representation. The aim of the present study was to develop such a model, with a focus on FM-sweeps of the duration and frequency span of formant transitions

Objectives
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.