Abstract

A male and a female talker each produced a set of /bVb/ syllables (V = /i,I,e,ε,æ,a,ʌ,ɔ,o,ʋ,u/) at metronome‐controlled rates. The central 60% of the voiced regions of these syllables was edited out, producing single‐talker silent‐center syllables [cf. W. Strange, J. J. Jenkins, and T. R. Edman, J. Acoust. Soc. Am. 61, S39 (1977)]. Comparable “hybrid” silent‐center syllables were created by appending the final 20% of one talker's syllable to the initial 20% of the other talker's production of the same syllable, again with a 60% silent region intervening. Vowel identification errors were relatively low for both the single‐talker (22.4%) and the hybrid (27.0%) silent‐center syllables, though errors in each case exceeded those found for the original (unedited) syllables, though errors in each case exceeded those found for the original (unedited) syllables (9.3%). Hybrids were generally heard as coherent syllables from a single source; the error pattern for the hybrids could not be interpreted as a joint function of errors on the initial pieces (55.4% on average) and the final pieces (72.9%). Results indicate that considerable information for vowels is contained in the dynamic regions early and late in a syllable. Moreover, they suggest that this information does not simply subserve computation of an underlying normalized target. The information may specify properties of production dynamics that are invarient for a vowel for all talkers of a dialect. To the extent that such information is available, normalization in perception is unnecessary. [Research supported by NICHD.]

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call