The effect of rhythm on inter-gestural coupling of onset and vowel gestures and predictive timing in stuttering

  • Abstract
  • References
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

The effect of rhythm on inter-gestural coupling of onset and vowel gestures and predictive timing in stuttering

ReferencesShowing 10 of 89 papers
  • Open Access Icon
  • Cite Count Icon 19
  • 10.1515/lp-2014-0014
Timing of German onset and word boundary clusters
  • Jan 1, 2014
  • Laboratory Phonology
  • Jana Brunner + 3 more

  • Cite Count Icon 35
  • 10.1016/s0094-730x(96)00016-2
Analysis of F2 transitions in the speech of stutterers and nonstutterers
  • Feb 1, 1997
  • Journal of Fluency Disorders
  • Michael Robb + 1 more

  • Cite Count Icon 96
  • 10.1016/j.jfludis.2007.11.005
Investigating speech motor practice and learning in people who stutter
  • Nov 29, 2007
  • Journal of Fluency Disorders
  • Aravind Kumar Namasivayam + 1 more

  • Open Access Icon
  • Cite Count Icon 58
  • 10.1111/desc.12521
Auditory-motor adaptation is reduced in adults who stutter but not in children who stutter.
  • Mar 2, 2017
  • Developmental Science
  • Ayoub Daliri + 4 more

  • Open Access Icon
  • Cite Count Icon 22
  • 10.1044/2018_jslhr-s-17-0435
The Effects of Syntactic Complexity and Sentence Length on the Speech Motor Control of School-Age Children Who Stutter.
  • Sep 19, 2018
  • Journal of Speech, Language, and Hearing Research
  • Evan R Usler + 1 more

  • Cite Count Icon 180
  • 10.1044/jshr.2301.122
Stuttering
  • Mar 1, 1980
  • Journal of Speech, Language, and Hearing Research
  • Gerald Zimmermann

  • Cite Count Icon 1
  • 10.1016/j.jcomdis.2022.106194
Speech kinematic variability in adults who stutter is influenced by treatment and speaking style
  • Feb 2, 2022
  • Journal of Communication Disorders
  • Torrey M Loucks + 3 more

  • Cite Count Icon 18
  • 10.1044/jslhr.4104.802
Stuttering, stressed syllables, and word onsets.
  • Aug 1, 1998
  • Journal of Speech, Language, and Hearing Research
  • Carol P Hubbard

  • Open Access Icon
  • Cite Count Icon 233
  • 10.1016/j.wocn.2018.03.002
Analyzing dynamic phonetic data using generalized additive mixed modeling: A tutorial focusing on articulatory differences between L1 and L2 speakers of English
  • Jun 21, 2018
  • Journal of Phonetics
  • Martijn Wieling

  • Open Access Icon
  • PDF Download Icon
  • Cite Count Icon 49
  • 10.1186/s11689-015-9123-8
Speech motor planning and execution deficits in early childhood stuttering.
  • Aug 20, 2015
  • Journal of Neurodevelopmental Disorders
  • Bridget Walsh + 2 more

Similar Papers
  • Research Article
  • Cite Count Icon 19
  • 10.1017/s0952675711000297
A gestural account of a child-specific neutralisation in strong position
  • Dec 1, 2011
  • Phonology
  • Tara McAllister Byun

The child-specific phenomenon of preferential neutralisation in initial position, which reverses a positional bias attested across adult grammars, represents a long-standing problem for formal models of developmental phonology. In a phonetically based model of phonology, child-specific phonological patterns may emerge as the consequence of physical differences between child and adult speech. This paper presents new case-study data suggesting that a child-specific pattern of fricative neutralisation in initial position has its roots in children's articulatory limitations. Coarticulated fricative and vowel gestures are shown to require independent control of the tongue and jaw, known to be problematic for developing speakers. Substitution errors affecting fricatives are analysed as a phonologised reflex of this phonetic pressure to avoid overlapping vowel and fricative gestures. The positional asymmetry emerges as the consequence of the differing degrees of gestural overlap permitted in syllable-initialvs. syllable-final position, as encoded in the framework of Articulatory Phonology (Browman & Goldstein 1986).

  • Research Article
  • Cite Count Icon 1
  • 10.1121/1.415635
An acoustic evaluation of variation in the overlap of consonant and vowel gestures induced by speaking rate change.
  • Apr 1, 1996
  • The Journal of the Acoustical Society of America
  • Kris Tjaden + 1 more

There is a growing body of empirical data supporting a gestural‐based view of speech production in which adjacent speech gestures temporally and spatially overlap one another. Recent work suggests that certain acoustic measures are sensitive to changes in the overlap of adjacent consonant and vowel gestures [G. Weismer et al., J. Phon. 23, 111–126 (1995); E. Zsiga, J. Phon. 22, 121–140 (1994)]. Utilizing a graded speaking rate task, eight speakers produced 160 repetitions of ten target words embedded in a carrier phrase. F2 onset frequency, measured at the consonant–vowel boundary of test syllables, was used to index the degree of spectral‐temporal overlap of adjacent consonant and vowel gestures. Regression analyses were used to evaluate the extent to which F2 onset predicted temporal variability in F2 formant trajectories due to speaking rate change. Results suggest that consonant and vowel gestures do not simply temporally slide away from each other with slowed speaking rate. Rather, modifications in the form and magnitude of vowel gestures are required to account for the present acoustic data. An empirically based acoustic model of variation in gesture overlap induced by speaking rate change is offered. Individual speaker differences also are discussed.

  • Conference Article
  • 10.1121/2.0001691
Dynamic evidence for the vowel gesture retention of devoiced high vowels in Tokyo Japanese
  • Jan 1, 2022
  • Rion Iwasaki + 4 more

High vowels in Tokyo Japanese are typically devoiced between voiceless obstruents. Controversy remains over whether vowel gestures in devoiced vowels persist or are deleted. A previous ultrasound study (Iwasaki et al., 2020) showed that the lingual articulation of the /kV/ release differed by vowel context even when devoiced. This study looked at changes in the midsagittal tongue surface over time by quantifying the shape and location of the tongue within the vocal tract. Native speakers of Tokyo Japanese produced items (/kVC2e/) that contrasted in the voicing of V, which was either /i/ or /u/. Tongue contours were quantified by Discrete Fourier Transform (DFT), using the approach of Liljencrants (1971). Time series of phase and magnitude of the first Fourier coefficient were compared by vowel context (/i/ vs. /u) and vowel voicing (devoiced vs. voiced). These showed the presence of the vowel contrast whether vowels were devoiced or voiced, suggesting that some devoiced vowels retain lingual gestures. For /u/ only, we found a voicing-induced difference, consistent with reduction or deletion of this vowel gesture. DFT can be a useful tool to analyze tongue contours, supporting the analysis of tongue shape evolving over time.

  • Research Article
  • Cite Count Icon 249
  • 10.1159/000261823
Some notes on syllable structure in articulatory phonology.
  • Mar 1, 1988
  • Phonetica
  • Catherine P Browman + 1 more

Two approaches to seeking stable patterns in the gestural organization of speech are examined: local organization (individual gestures coordinated with other individual gestures) and global organization (gestures forming larger conglomerates). Articulatory evidence from American English words with a variety of initial consonants and clusters shows that syllable-initial consonants form a global organization (indexed by a metric we term the C-center) that is coordinated with the syllable's vowel gesture. For syllable-final consonants, however, the evidence suggests that a local organization is employed: The first postvocalic consonant gesture is coordinated with the vowel gesture. Implications of these different styles of organization for the perceptual and phonological structure of speech are discussed.

  • Dissertation
  • 10.31390/gradschool_disstheses.5407
Perceptual Evaluation of Infant Articulatory Transitions.
  • Jun 14, 2022
  • Nancye Roussel

The transition portion of the speech signal has been identified as critical to the perception of both consonants and vowels. Furui (1986) demonstrated that for adult Japanese speakers a 10 ms segment of the transition centered on the area of maximum spectral movement contained the most critical information for joint consonant and vowel perception. The distribution of phonemic cues contained within consonant-vowel (CV) non-reduplicated (NRB) syllables of infants ages 6 months to 15 months was compared to the distribution observed in adult productions to examine the extent of coproductive overlap of infant consonant and vowel gestures. Truncated versions of infant CV syllables in which initial and final portions of the syllable were systematically deleted were presented to adult listeners for syllable identification. Perceptual critical points (truncation point at which syllable identification fell below 60% for the first time) were determined for both initial and final truncation conditions, and critical intervals were calculated. Results revealed significant differences in the temporal distribution and strength of phonemic cues within the infant NRB CV syllable. First, perceptual critical points could only be determined in approximately 2/3 of the 60 NRB syllables included in the perceptual study, and fewer than 1/3 of the infant syllables demonstrated normal, adult-like identification functions under both initial and final truncation conditions. Obviously, the infant syllables did not manifest the level of cue redundancy or cue constancy seen in adult syllables. Second, infant critical intervals were on the average 2 1/2 times longer than those of adult Japanese speakers reflecting an overlap of the separate consonant and vowel gestures that is less compact than in adult syllables. Results also failed to identify a clear developmental progression towards either more adult-like critical interval durations or identification functions in the NRB syllables studied.

  • Research Article
  • Cite Count Icon 45
  • 10.1159/000261868
An Articulatory Study of Consonant-Induced Vowel Duration Changes in English
  • Jan 1, 1991
  • Phonetica
  • Kenneth De Jong

The effects of voicing, manner, and number of consonants on the duration of previous vowels in English can be created in three ways: by planned expansion or contraction of the entire vowel, by changing the relative timing of consonant and vowel gestures, or by modifying the duration of the closing movement for the consonants. An X-ray microbeam data base of 2 speakers of American English reciting monosyllabic words, which differed in their final consonants and in their position in the accentual structure of the utterance, was studied to evaluate these three accounts. In this data base, vowel duration differences due to the voicing of the following consonants were amplified by the presence of accent. Also, many of the duration differences associated with the following consonants are localized in the later portion of the opening movement, suggesting that subjects often create vowel duration changes by initiating the consonant closing gesture at an earlier or later time relative to the opening gesture.

  • Research Article
  • Cite Count Icon 3
  • 10.1111/j.1460-6984.1995.tb01687.x
Consonant‐vowel interactions in developmental phonological disorder
  • Oct 1, 1995
  • International Journal of Language & Communication Disorders
  • Sally Bates + 1 more

This paper presents examples of consonant‐vowel interactions collected from a group of Edinburgh children presenting with developmental phonological disorder. Three types of interaction are described: vowel conditioning of consonant production, consonant conditioning of vowel production and use of consonantal material to maintain vowel contrasts. Evidence of context conditioning in both consonant and vowel error patterns highlights the potential importance of assessing the child's sound system as a whole. It also raises the question of how far coarticulatory constraints play a role in phonological disorder. In this paper, the data are interpreted, where possible, in terms of production constraints and the degree of compatibility between consonant and vowel gestures. This approach represents one particular perspective and is not intended to preclude other perceptual and/or phonological accounts of the data.

  • Research Article
  • 10.1121/1.2026894
P centers, C centers, or what else?
  • May 1, 1989
  • The Journal of the Acoustical Society of America
  • Bernd Pompino-Marschall + 2 more

Some questions concerning the underlying nature of the so‐called P‐center phenomenon (the acoustic anisochrony of rhythmically regularly produced/perceived sequences of monosyllables) are discussed in the light of recent EMG data from experiments with rhythmically produced /Cak/ syllables (with /C/ = /p/, /pf/, /pl/, /pt/, /pfl/). The differences between /p/ vs /pf/ syllables resemble the timing differences in the microbeam data of Browman and Goldstein [Haskins SR‐93/94, 85–102 (1988)]. Whereas they found a stable timing relation between the “articulatory mean” of the initial consonants (termed C center) and the acoustical offset of the following vowel, in terms of their task dynamic description, however, the phase relations between consonant and vowel gestures should remain the same. This would result in a reduced stiffness of the vowel gesture for the syllables containing consonant clusters. The observations of the present study would also correspond to a reduction of stiffness for the vowel but the numerical data seem to contradict a stable phasing relation. In connection with the psychoacoustic interpretation of the P‐center phenomenon, an argument is made against a strictly intrinsic timing interpretation (based on articulatory regularities) of the effects of rhythmical production/perception.

  • Research Article
  • Cite Count Icon 2
  • 10.1121/1.417596
Differences in speaker’s articulatory space: Their contribution to vowel gesture and acoustic pattern
  • Oct 1, 1996
  • The Journal of the Acoustical Society of America
  • Kiyoshi Honda + 1 more

The view that individual speaker characteristics reflect morphological and functional differences in the speech apparatus has been accepted as self-evident despite the lack of convincing empirical data. A recent study on the issue of individual differences based on x-ray microbeam data from ten American English and ten Japanese speakers (five males and five females in each group) has pointed to a significance of morphology in accounting for speaker characteristics of vowel gesture. The midsagittal form of a speaker’s orofacial structure was parametrized by several landmarks on x-ray scan images. A few points and lines were chosen to form a quadrilateral framework so that they define a geometrical ‘‘articulatory space’’ where articulation of the tongue–jaw complex takes place. Pellet data for vowels were mapped onto speaker’s articulatory space to observe the effect of morphological difference on vowel articulation. Preliminary results have shown the extent of morphological variation of the speech apparatus and adaptability of vowel gesture to different forms of the space. The aspect ratio of the articulatory space was judged as the primary causal factor of articulatory variation. Acoustic manifestation and perceptual contribution of the variation will be discussed from a viewpoint of auditory-articulatory linkage.

  • Research Article
  • 10.1121/10.0016299
Retention of devoiced vowels in Tokyo Japanese: Evidence from lip articulation
  • Oct 1, 2022
  • The Journal of the Acoustical Society of America
  • Rion Iwasaki + 4 more

In Tokyo Japanese, high vowels /i/ and /u/ are frequently devoiced when they are surrounded by voiceless obstruents. Controversy remains over whether vowel gestures in devoiced vowels are retained or instead deleted. Both static (Iwasaki et al., 2020) and dynamic (Iwasaki et al., 2022) ultrasound data have indicated that vowel-specific lingual gestures can persist even when vowels are devoiced. This study focuses on the lip articulation of devoiced vowels by examining lateral lip aperture, where lower values index a greater degree of the rounding of vowels. Native speakers of Tokyo Japanese produced nonce word pairs with the form of /C1VC2V2toko/. V1 was either /i/ or /u/. C1 and C2 were either voiced or voiceless, which determined the voicing of V1. Lateral lip aperture during the first mora was calculated by identifying facial landmarks using OpenFace 2.0 (Baltrusaitis et al., 2018), and compared across vowel quality (/i/ vs./u/) and vowel voicing (devoiced vs. voiced). Preliminary results show that lateral lip aperture is larger for /i/ than for /u/ in both devoiced and voiced environments, indicating that these vowels maintain their labial specifications even when devoiced, providing additional evidence that devoiced vowels can retain their articulatory gestures.

  • Research Article
  • Cite Count Icon 57
  • 10.1121/1.423299
Dynamic specification of coarticulated German vowels: perceptual and acoustical studies.
  • Jul 1, 1998
  • The Journal of the Acoustical Society of America
  • Winifred Strange + 1 more

To examine the generality of Strange's Dynamic Specification Theory of vowel perception, two perceptual experiments investigated whether dynamic (time-varying) acoustic information about vowel gestures was critical for identification of coarticulated vowels in German, a language without diphthongization. The perception by native North German (NG) speakers of electronically modified /dVt/ syllables produced in carrier sentences was assessed using the "silent-center" paradigm. The relative efficacy of static target information, dynamic spectral information (defined over syllable onsets and offsets together), and intrinsic vowel length was investigated in listening conditions in which the centers (silent-center conditions) or the onsets and offsets (vowel-center conditions) of the syllables were silenced. Listeners correctly identified most vowels in silent-center syllables and in vowel-center stimuli when both conditions included information about intrinsic vowel length. When duration information was removed, errors increased significantly, but performance was relatively better for silent-center syllables than for vowel-center stimuli. Acoustical analyses of the effects of coarticulation on target formant frequencies, vocalic duration, and dynamic spectro-temporal patterns in the stimulus materials were performed to elucidate the nature of the dynamic spectral information. In comparison with vowels produced in citation from /hVt/ syllables by the same speaker, the coarticulated /dVt/ utterances showed considerable "target undershoot" of formant frequencies and reduced duration differences between tense and lax vowel pairs. This suggests that both static spectral cues and relative duration information for NG vowels may not remain perceptually distinctive in continuous speech. Analysis of formant movement within syllable nuclei corroborated descriptions of German vowels as monophthongal. However, an analysis of first formant temporal trajectories revealed distinct patterns for tense and lax vowels that could be used by listeners to disambiguate coarticulated NG vowels.

  • Research Article
  • 10.1121/10.0010666
An ultrasound study of high vowel devoicing in Tokyo Japanese: Evidence for the vowel gesture retention
  • Apr 1, 2022
  • The Journal of the Acoustical Society of America
  • Rion Iwasaki + 4 more

High vowels in Tokyo Japanese are typically devoiced between voiceless obstruents, but controversy remains over whether vowel gestures persist when devoiced or are instead deleted. A previous ultrasound study (Iwasaki et al., 2020) showed that the lingual articulation of the release burst of /kV/ differs by vowel context even when the vowels are devoiced. This study uses tongue surface contours derived from midsagittal ultrasound images to investigate the effects of vowel devoicing on changes in quantified tongue shape over time. Native speakers of Tokyo Japanese produced word pairs (/C1VC2e/) that contrasted in the voicing of V, which was either /i/ or /u/. Tongue shape was characterized by Fourier transforming tangent angles along each contour (Dawson et al., 2016). Time-normalized trajectories over the /C1VC2e/ sequence were compared by vowel context (/i/ versus /u/) and voicing environment (devoiced versus voiced). Preliminary results show that the real component of the first Fourier coefficient is sensitive todetecting evolving shape differences between the two vowel contexts over the sequence, not just when the vowels are voiced, but also when they are devoiced. Based on these results, the high vowel contrast persists even in the devoiced environment, suggesting that devoiced vowels are not deleted.

  • Research Article
  • 10.1121/1.407547
Perceptual evaluation of infant articulatory transitions
  • Sep 1, 1993
  • The Journal of the Acoustical Society of America
  • Nancye C Roussel + 1 more

The distribution of phonemic cues contained within consonant–vowel (CV) nonreduplicated (NRB) syllables of infants ages 6 to 15 months was compared to the distribution observed in adult productions [S. Furui, J. Acoust. Soc. Am. 57, 1016–1025 (1986)] to examine the extent of coproductive overlap of infant consonant and vowel gestures. Truncated versions of infant CV syllables were presented to adult listeners for syllable identification. Perceptual critical points (truncation point at which syllable identification fell below 60% for the first time) were determined for both initial and final truncation conditions, and critical intervals were calculated. Results revealed significant differences in the temporal distribution and strength of phonemic cues within the infant NRB CV syllable. Fewer than 1/3 of the infant syllables demonstrated normal, adult‐like identification functions under both initial and final truncation conditions, and infant critical intervals were on the average 21/2 times longer than those of adult Japanese speakers. Results did not reveal a clear developmental progression towards either more adult‐like critical interval durations or identification functions in the NRB syllables studied.

  • Research Article
  • 10.1121/1.415176
Gestural overlap analysis of lenis stop reduction in Korean.
  • Apr 1, 1996
  • The Journal of the Acoustical Society of America
  • Sook‐Hyang Lee

Korean lenis stops have been reported to become voiced intervocalically (Kagaya, 1974) and are often reduced to sonorants in casual speech with different degrees of reduction depending on the place of articulation (Lee, 1995). Velar lenis stops are more often reduced than the labials and coronals, which makes sense from the point of view of articulatory phonology (Browman and Goldstein, 1990). The oral gesture for a velar stop is on the same vocal tract tier (tongue body tier) as the neighboring vowels, so it should be more affected by overlap and blending with the vowel gestures than the closure gestures for other stops. This study investigates in what vowel environments lenis stop closure shows the most reduction. /VCV/ tokens in carrier sentences were recorded and acoustic analysis was done. The results showed that generally, reducing lenis stop closure is most in the environment of preceding and following low vowels, and least in the environment of high vowels, where the tongue body gestures of neighb...

  • Research Article
  • Cite Count Icon 7
  • 10.1121/1.2024063
Acoustic and articulatory evidence for consonant-vowel interactions
  • Dec 1, 1986
  • The Journal of the Acoustical Society of America
  • Carol Fowler + 3 more

Previous acoustical research [e.g., C. Fowler, Phonetica 38, 35–50 (1981)] has suggested that consonants and vowels are not produced strictly sequentially but rather overlap in time. By this account, the degree of co-production of underlying segments affects the pattern of the resulting acoustical durations. In the present experiment, two subjects produced utterances of the form CVC(C), where the vowels were either /ε/ or /æ/ and the following consonants were /p/, /k/, /s/, /ps/, /ks/, /sp/, or /sk/, Movements of the lips and jaw were monitored with a Selspot system. Laryngeal movements were measured by means of transillumination, and the accompanying acoustic signal was recorded. Analysis of the acoustic durations revealed that vowel duration was shorter when followed by a cluster than by a singleton in both vowel contexts. However, the following consonant durations were longer for the vowel /ε/ than for /æ/. Analyses of the lip and jaw kinematics suggested that the vowel shortening in the cluster contexts was produced in part by greater temporal overlap of the underlying segments. The influence of the intrinsic vowel duration on the following consonants may reflect a syllabic constraint on articulatory timing. The results will be discussed in terms of the articulatory structure of phonetic segments and the sequential coordination of consonant and vowel gestures. [Work supported by NINCDS.]

More from: Journal of Phonetics
  • Research Article
  • 10.1016/j.wocn.2025.101456
Gestural restructuring beyond coarticulation in Korean /w/-vowel sequences: Evidence from phonetic, dialectal, and gender variation
  • Nov 1, 2025
  • Journal of Phonetics
  • Dae-Yong Lee + 2 more

  • Research Article
  • 10.1016/j.wocn.2025.101457
Imitation of F0 tone contours by Mandarin and English speakers is both categorical and continuous
  • Nov 1, 2025
  • Journal of Phonetics
  • Wei Zhang + 2 more

  • Research Article
  • 10.1016/j.wocn.2025.101453
Transitory and sustained Cf0 effects: Evidence from Swiss German
  • Nov 1, 2025
  • Journal of Phonetics
  • Franka Zebe-Sheng + 3 more

  • Research Article
  • 10.1016/j.wocn.2025.101454
F0 derivatives in the classification of meaningful tonal movements
  • Nov 1, 2025
  • Journal of Phonetics
  • Constantijn Kaland

  • Research Article
  • 10.1016/j.wocn.2025.101432
The effect of rhythm on inter-gestural coupling of onset and vowel gestures and predictive timing in stuttering
  • Sep 1, 2025
  • Journal of Phonetics
  • Mona Franke + 3 more

  • Research Article
  • 10.1016/j.wocn.2025.101430
Velar palatalization, phonologization, and sound change – A comparative acoustic study of /k/-fronting in Majorcan Catalan
  • Sep 1, 2025
  • Journal of Phonetics
  • Miquel Simonet + 2 more

  • Research Article
  • 10.1016/j.wocn.2025.101431
A sociophonetic study of creaky voice across language, gender and age in Canadian English-French bilinguals
  • Sep 1, 2025
  • Journal of Phonetics
  • Jeanne Brown + 1 more

  • Research Article
  • 10.1016/j.wocn.2025.101433
Effects of sentential context on nonnative recognition of reduced speech: Does meaning explain it all?
  • Sep 1, 2025
  • Journal of Phonetics
  • Bihua Chen + 1 more

  • Research Article
  • 10.1016/j.wocn.2025.101434
Phonetic information in the vowel spectrum: the meaning of Mel-Frequency Cepstral Coefficients
  • Sep 1, 2025
  • Journal of Phonetics
  • Khalil Iskarous + 1 more

  • Research Article
  • 10.1016/j.wocn.2025.101429
Perceived cross-linguistic similarity of retroflexes in trilingual, bilingual and native listener groups
  • Sep 1, 2025
  • Journal of Phonetics
  • Anna Balas + 3 more

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon

AI summaries and top papers from 250M+ research sources.

Search IconWhat is the difference between bacteria and viruses?
Open In New Tab Icon
Search IconWhat is the function of the immune system?
Open In New Tab Icon
Search IconCan diabetes be passed down from one generation to the next?
Open In New Tab Icon