Abstract

Theories of prevocalic stop consonant perception differ with respect to the amount of spectral resolution required by the model, e.g., gross spectral shape theories predict that human speech perception is more robust against decreased frequency selectivity where detailed cues such as vocalic formant transitions break down. Previous results in the perception of noise vocoded speech [M. Kiefte, Int. Conf. Phon. Sci. (1999)] showed that, although listeners’ performance worsened significantly at bandwidths as low as 500 Hz, a spectral shape model using cepstral coefficients was better able to predict subjects’ responses. However, a simulation which showed that release bursts require much less spectral resolution than vocalic formant transitions complicates the interpretation. Therefore, in this experiment, subjects were asked to identify the place of articulation for gated release bursts independently of vocalic portions for both unprocessed and noise channel vocoded speech. Preliminary results show that correct identification of stop place from only the release burst does not change significantly up to a channel bandwidth of 2000 Hz. Results suggest that responses from the previous experiment can be explained by the robustness of the burst in those stimuli. Gross versus detailed cue models are also evaluated with respect to responses to formant only stimuli. [Work supported by SSHRC.]

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.