A theoretical framework for the segmental component of speech production is outlined and some preliminary supporting data are reviewed. According to the framework, articulatory movements are programmed to achieve sequences of goals that are defined in terms of articulatory and acoustic parameters. The goals are correlates of distinctive features. Some feature correlates are determined by quantal (non-linear) relations between articulation and sound. Goals may also be influenced by other principles, such as a compromise between sufficient perceptual contrast and economy of articulatory effort, which leads to the prediction that the goal definitions correspond to regions (as opposed to points) in acoustic and articulatory space. Thus the goals are characterized by some parameter variation, which is possible partly because listeners can understand variable speech. Before utterances are produced, goal specifications are modified by prosodic influences and reduction. The sequence of modified goal specifications is converted to smooth, appropriately-timed articulatory movements by the speech motor control system. This control and the resulting kinematics are constrained in part by the biomechanical properties of the articulators. To help keep acoustic variability within perceptually-acceptable limits, speech motor control mechanisms may include a strategy by which different parts of the vocal-tract area function are adjusted in a complementary (“motor equivalent”) manner. The strategy takes advantage of the fact that for some sounds, a similar acoustic transfer function can be achieved with somewhat different area functions. The existence of such a strategy and the idea that speech motor programming is based in part on acoustic goals are supported by data that show trading relations between lip rounding and tongue-body raising in production of the vowel /u/.
Read full abstract