Abstract

Listeners are highly proficient at adapting to contextual variation when perceiving speech. In the present study, we examined the effects of brief speech and nonspeech contexts on the perception of sibilant fricatives. We explored three theoretically motivated accounts of contextual adaptation, based on phonetic cue calibration, phonetic covariation, and auditory contrast. Under the cue calibration account, listeners adapt by estimating a talker-specific average for each phonetic cue or dimension; under the cue covariation account, listeners adapt by exploiting consistencies in how the realization of speech sounds varies across talkers; under the auditory contrast account, adaptation results from (partial) masking of spectral components that are shared by adjacent stimuli. The spectral center of gravity, a phonetic cue to fricative identity, was manipulated for several types of context sound: /z/-initial syllables, /v/-initial syllables, and white noise matched in long-term average spectrum (LTAS) to the /z/-initial stimuli. Listeners’ perception of the /s/–/ʃ/ contrast was significantly influenced by /z/-initial syllables and LTAS-matched white noise stimuli, but not by /v/-initial syllables. No significant difference in adaptation was observed between exposure to /z/-initial syllables and matched white noise stimuli, and speech did not have a considerable advantage over noise when the two were presented consecutively within a context. The pattern of findings is most consistent with the auditory contrast account of short-term perceptual adaptation. The cue covariation account makes accurate predictions for speech contexts, but not for nonspeech contexts or for the absence of a speech-versus-nonspeech difference.

Highlights

  • The realization of a speech sound can vary substantially according to contextual factors such as neighboring speech sounds, global factors like speaking rate, and talker characteristics, including physiology, dialect, and idiosyncratic factors (e.g., Byrd, 1992; Delattre, Liberman, & Cooper, 1955; Johnson & Beckman, 1997; Liberman, Cooper, Shankweiler, & Studdert-Kennedy, 1967; Miller, Green, & Reeves, 1986; Nolan, 1983; Peterson & Barney, 1952)

  • In the present study we investigated the mechanisms behind short-term and generalized adaptation to talker-specific spectral properties in fricative consonants

  • The findings revealed that auditory contrast best accounted for the observed adaptation effects when the exposure and test stimuli were adjacent to one another, with a relatively short amount of intervening time (< 2 s)

Read more

Summary

Introduction

The realization of a speech sound can vary substantially according to contextual factors such as neighboring speech sounds, global factors like speaking rate, and talker characteristics, including physiology, dialect, and idiosyncratic factors (e.g., Byrd, 1992; Delattre, Liberman, & Cooper, 1955; Johnson & Beckman, 1997; Liberman, Cooper, Shankweiler, & Studdert-Kennedy, 1967; Miller, Green, & Reeves, 1986; Nolan, 1983; Peterson & Barney, 1952). We examined the effects of brief speech and nonspeech contexts on the perception of sibilant fricatives Fricatives such as /s/ vary in their spectral properties across phonetic contexts (e.g., Jongman, Wayland, & Wong, 2000; Soli, 1981) and different talkers or talker groups (e.g., Flipsen, Shriberg, Weismer, Karlsson, & McSweeny, 1999; Jongman et al, 2000; Newman, Clouse, & Burnham, 2001). Listeners adapt to such context- and talker-specific realizations. Listeners shift the perceptual boundary between /s/ and /f/ according to whether a talker produced

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call