Continuous Speech Stream Research Articles

Understanding a sentence that someone else utters may seem effortless to you, but it's actually a complex process because you have to parse that sentence into many parts to understand what it means. Speech consists of a hierarchy of components that each takes place on a different timescale. Speech cues such as intonation occur on a relatively long timescale, unfolding over hundreds of milliseconds. At the other end of the spectrum is the phoneme—the smallest unit of speech—which lasts only tens of milliseconds. Gross and co-workers show that hierarchical speech patterns (top) entrain hierarchical wave frequencies in the brain (bottom). Similarly, distinct sets of neurons in the brain fire rhythmically at different rates, and these oscillations can also be arranged in a hierarchy. For instance, slow delta-wave oscillations can influence the magnitude of faster theta oscillations, which in turn can alter the amplitude of even faster gamma oscillations. Rhythmic brain activity plays an important role in a variety of cognitive processes, including attention, memory, and decision-making. A new study published in PLOS Biology now offers insights into the important role that neural oscillations play in speech perception. Joachim Gross, Simon Garrod, and co-workers report novel evidence that hierarchically organized neural oscillations allow people to parse the hierarchical components of speech. The findings not only provide a more complete understanding of the role of brain oscillations in cognition, but they also reveal that people can efficiently understand speech by simultaneously processing different speech components on various timescales. Gross and his team used a brain imaging technique called magnetoencephalography to record neural activity in 22 volunteers while they listened to a seven-minute real-life story. The researchers found that neural oscillations were arranged in a hierarchy: delta oscillations influenced the magnitude of theta oscillations, which in turn affected the amplitude of gamma oscillations. These hierarchical oscillations matched the hierarchical components of speech. Slow delta-wave oscillations in brain regions that process auditory information were in sync with slow rhythmic changes in speech, corresponding to the timescale of prosody—the rhythm, stress, and intonation of speech. This temporal alignment was more pronounced in the right hemisphere of the brain than in the left hemisphere. Meanwhile, the magnitude of fast gamma-wave oscillations in auditory brain areas was influenced by changes in speech that took place on a relatively short timescale, corresponding to syllables. This effect was stronger in the left hemisphere than in the right hemisphere. Collectively, these findings are consistent with a theory called “asymmetric sampling in time” (AST), which proposes that the right hemisphere preferentially processes information over longer timescales, whereas the left hemisphere shows a bias for processing information over shorter timescales. Gross and his collaborators then examined how brain oscillations were affected by variations in speech over time. Speech is not strictly periodic; sometimes people unexpectedly stop talking or begin to talk more quickly. The researchers found that sudden, large changes in the physical features of speech reset the timing of oscillations in auditory brain areas, helping to sync up important speech events with oscillatory brain activity. This finding illustrates how oscillations adapt to changes in the rhythm of speech, allowing the brain to continue to efficiently process speech even when the physical features abruptly change. Taken together, the findings show that hierarchically organized brain oscillations work in concert to track speech components occurring at different timescales, helping to convert a continuous speech stream into meaningful internal representations. Gross J, Hoogenboom N, Thut G, Schyns P, Panzeri S, et al. (2013) Speech Rhythms and Multiplexed Oscillatory Sensory Coding in the Human Brain. doi:10.1371/journal.pbio.1001752

Read full abstract

Chunking or Not Chunking? How Do We Find Words in Artificial Language Learning? Ana Franco (afranco@ulb.ac.be) Arnaud Destrebecqz (adestre@ulb.ac.be) Cognition, Consciousness and Computation Group Universite libre de Bruxelles, 50 ave. F.-D. Roosevelt, B1050 BELGIUM Abstract What is the nature of the representations acquired in implicit statistical learning? Recent results in the field of language learning have shown that adults and infants are able to find the words of an artificial language when exposed to a continuous auditory sequence consisting in a random ordering of these words. Such performance can only be based on processing the transitional probabilities between sequence elements. Two different kinds of mechanisms may account for these data: Participants either parse the sequence into smaller chunks corresponding to the words of the artificial language, or they become progressively sensitive to the actual values of the transitional probabilities. The two accounts are difficult to differentiate because they tend to make similar predictions in similar experimental settings. In this study, we present two experiments aimed at disentangling these two theories. In these experiments, participants had to learn two sets of pseudo-linguistic regularities (L1 and L2) presented in the context of a Serial Reaction Time (SRT) task. L1 and L2 were either unrelated, or the intra- words transitions of L1 became the inter-words transitions of L2. The two models make opposite predictions in these two situations. Our results indicate that the nature of the representations depends on the learning conditions. When cues are presented to facilitate parsing of the sequence, participants learned the words of the artificial language. However, when no cues were provided, their performance was strongly influenced by the actual values of the transitional probabilities. Keywords: implicit statistical learning; SRN; chunking; serial reaction time task Introduction A central issue in implicit learning research concerns the nature of the acquired knowledge. Does it reflect the abstract rules on which the training material is based or the surface features of the material, such as the frequencies of individual elements or chunks? According to some theorists, cognition can be viewed as rule-based symbol manipulation (Pinker & Price, 1988). From this perspective, learning would consist in the formation of new abstract, algebra-like rules. According to another theoretical position, information processing is essentially based on associative processes. In this view, learning would not depend on rule acquisition but on mechanisms capable of extracting the statistical regularities present in the environment (e.g., Elman, 1990). Over the last few years, a series of experimental results have provided new insights into the question of the nature of the representations involved in implicit learning. Research on language acquisition has shown that 8-months old infants are sensitive to statistical information (Jusczyk et al., 1994; Saffran, Aslin, & Newport, 1996; Saffran, Johnson, Aslin, & Newport, 1999) and capable of learning distributional relationships between linguistic units (Gomez & Gerken, 1999; Jusczyk, Houston, & Newsome, 1999; Saffran, Aslin, & Newport, 1996; Perruchet & Desaulty, 2008) presented in the continuous speech stream formed by an artificial language. Other studies have indicated that adults are also capable of extracting statistical regularities, and that these mechanisms are not restricted to linguistic material but also apply to auditory non-linguistic stimuli (Saffran, Johnson, Aslin, & Newport, 1999) or to visual stimuli (Fiser & Aslin, 2002). In the same way, implicit sequence learning studies have indicated that human learners are good at detecting the statistical regularities present in a serial reaction time (SRT) task. Altogether, these data suggest that statistical learning depends on associative learning mechanisms rather than on the existence of a “rule abstractor device” (Perruchet, Tyler, Galland, & Peereman, 2004). However, different models have been proposed to account for the data. According to the Simple Recurrent Network model (Elman, 1990; Cleeremans, & McClelland, 1991; Cleeremans, 1993), learning is based on the development of associations between the temporal context in which the successive elements occur and possible successors. Over training, the network learns to provide the best prediction of the next target in a given context, based on the transitional probabilities between the different sequence elements. On the other hand, chunking models, such as PARSER, consider learning as an attention-based parsing process that results in the formation of distinctive, unitary, rigid representations or chunks (Perruchet & Vinter, 1998). Thus, both models are based on processing statistical regularities, but only PARSER leads to the formation of “word-like” units. Although the representations assumed by these two classes of models are quite different, contrasting their assumptions is made difficult by the fact that they tend to make similar experimental predictions. For instance, in a typical artificial language learning experiment, participants are exposed to a continuous stream of plurisyllabic non-words (e.g., BATUBI, DUTABA…) presented in a random order, such that transitional probabilities between syllables are stronger intra-word

Read full abstract

Continuous Speech Stream Research Articles

Related Topics

Articles published on Continuous Speech Stream

Why the body comes first: effects of experimenter touch on infants' word finding.

How Brain Waves Help Us Make Sense of Speech

Detection and identification of speech sounds using cortical activity patterns

The role of memory and representations in statistical learning

Transition Probabilities and Different Levels of Prominence in Segmentation

Spoken keyword detection using autoassociative neural networks

The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream

Differing strategies in English and Japanese word segmentation: A computational-psycholinguistic approach to bootstrapping the lexicon

Chunking or not chunking? How do we find words in artificial language learning?

Perception-based vowel insertion by native Spanish-speaking learners of English

Phrasal prosody constrains word segmentation in French 16-month-olds.

Acoustic Markers of Prominence Influence Infants’ and Adults’ Segmentation of Speech Sequences

The contribution of language-specific knowledge in the selection of statistically-coherent word candidates

Exploiting Multiple Sources of Information in Learning an Artificial Language: Human Data and Modeling

Speech segmentation is facilitated by visual cues

Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech

Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution

Second language word segmentation in a fluent speech listening task.

Statistical segmentation of tone sequences activates the left inferior frontal cortex: A near-infrared spectroscopy study

La conscience auto-organisatrice

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Continuous Speech Stream Research Articles

Related Topics

Articles published on Continuous Speech Stream

Why the body comes first: effects of experimenter touch on infants' word finding.

How Brain Waves Help Us Make Sense of Speech

Detection and identification of speech sounds using cortical activity patterns

The role of memory and representations in statistical learning

Transition Probabilities and Different Levels of Prominence in Segmentation

Spoken keyword detection using autoassociative neural networks

The Phoneme Automatic Segmentation Algorithms Study of Tibetan Lhasa Words Continuous Speech Stream

Differing strategies in English and Japanese word segmentation: A computational-psycholinguistic approach to bootstrapping the lexicon

Chunking or not chunking? How do we find words in artificial language learning?

Perception-based vowel insertion by native Spanish-speaking learners of English

Phrasal prosody constrains word segmentation in French 16-month-olds.

Acoustic Markers of Prominence Influence Infants’ and Adults’ Segmentation of Speech Sequences

The contribution of language-specific knowledge in the selection of statistically-coherent word candidates

Exploiting Multiple Sources of Information in Learning an Artificial Language: Human Data and Modeling

Speech segmentation is facilitated by visual cues

Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech

Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution

Second language word segmentation in a fluent speech listening task.

Statistical segmentation of tone sequences activates the left inferior frontal cortex: A near-infrared spectroscopy study

La conscience auto-organisatrice