Abstract

Neural oscillations constitute an intrinsic property of functional brain organization that facilitates the tracking of linguistic units at multiple time scales through brain-to-stimulus alignment. This ubiquitous neural principle has been shown to facilitate speech segmentation and word learning based on statistical regularities. However, there is no common agreement yet on whether speech segmentation is mediated by a transition of neural synchronization from syllable to word rate, or whether the two time scales are concurrently tracked. Furthermore, it is currently unknown whether syllable transition probability contributes to speech segmentation when lexical stress cues can be directly used to extract word forms. Using Inter-Trial Coherence (ITC) analyses in combinations with Event-Related Potentials (ERPs), we showed that speech segmentation based on both statistical regularities and lexical stress cues was accompanied by concurrent neural synchronization to syllables and words. In particular, ITC at the word rate was generally higher in structured compared to random sequences, and this effect was particularly pronounced in the flat condition. Furthermore, ITC at the syllable rate dynamically increased across the blocks of the flat condition, whereas a similar modulation was not observed in the stressed condition. Notably, in the flat condition ITC at both time scales correlated with each other, and changes in neural synchronization were accompanied by a rapid reconfiguration of the P200 and N400 components with a close relationship between ITC and ERPs. These results highlight distinct computational principles governing neural synchronization to pertinent linguistic units while segmenting speech under different listening conditions.

Highlights

  • Speech is a hierarchically organized acoustic signal composed of linguistic units at different time scales, such as phonemes, syllables and words (Ding et al, 2016)

  • Using inter-trial coherence (ITC) analyses in combinations with Event-Related Potentials (ERPs), we showed that speech segmentation based on both statistical regularities and lexical stress cues was accompanied by concurrent neural synchronization to syllables and words

  • Statistical learning is reflected by an N400-like event-related potential (ERP) with main generators in the posterior supratemporal plane and the ventral premotor cortex (Cunillera et al, 2009), whereas speech segmentation through pitch-based stress differences between syllables has been associated with the P200 ERP component which could be localized in the auditory cortex (Cunillera et al, 2006)

Read more

Summary

Introduction

Speech is a hierarchically organized acoustic signal composed of linguistic units at different time scales, such as phonemes, syllables and words (Ding et al, 2016). Previous behavioral studies have demonstrated that both statistical learning (Mattys et al, 2005) and prosodic cues (Johnson and Jusczyk, 2001; Thiessen and Saffran, 2003) can be used to segment speech and extract word forms from continuous acoustic signals. Statistical learning is reflected by an N400-like event-related potential (ERP) with main generators in the posterior supratemporal plane and the ventral premotor cortex (Cunillera et al, 2009), whereas speech segmentation through pitch-based stress differences between syllables (prosodic bootstrapping) has been associated with the P200 ERP component which could be localized in the auditory cortex (Cunillera et al, 2006)

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.