Neurophysiological Indices of Audiovisual Speech Processing Reveal a Hierarchy of Multisensory Integration Effects.

Aisling E O'Sullivan,Michael J Crosse,Giovanni M Di Liberto,Edmund C Lalor,Alain De Cheveigné

doi:10.1523/jneurosci.0906-20.2021

Abstract

Seeing a speaker's face benefits speech comprehension, especially in challenging listening conditions. This perceptual benefit is thought to stem from the neural integration of visual and auditory speech at multiple stages of processing, whereby movement of a speaker's face provides temporal cues to auditory cortex, and articulatory information from the speaker's mouth can aid recognizing specific linguistic units (e.g., phonemes, syllables). However, it remains unclear how the integration of these cues varies as a function of listening conditions. Here, we sought to provide insight on these questions by examining EEG responses in humans (males and females) to natural audiovisual (AV), audio, and visual speech in quiet and in noise. We represented our speech stimuli in terms of their spectrograms and their phonetic features and then quantified the strength of the encoding of those features in the EEG using canonical correlation analysis (CCA). The encoding of both spectrotemporal and phonetic features was shown to be more robust in AV speech responses than what would have been expected from the summation of the audio and visual speech responses, suggesting that multisensory integration occurs at both spectrotemporal and phonetic stages of speech processing. We also found evidence to suggest that the integration effects may change with listening conditions; however, this was an exploratory analysis and future work will be required to examine this effect using a within-subject design. These findings demonstrate that integration of audio and visual speech occurs at multiple stages along the speech processing hierarchy.SIGNIFICANCE STATEMENT During conversation, visual cues impact our perception of speech. Integration of auditory and visual speech is thought to occur at multiple stages of speech processing and vary flexibly depending on the listening conditions. Here, we examine audiovisual (AV) integration at two stages of speech processing using the speech spectrogram and a phonetic representation, and test how AV integration adapts to degraded listening conditions. We find significant integration at both of these stages regardless of listening conditions. These findings reveal neural indices of multisensory interactions at different stages of processing and provide support for the multistage integration framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neurophysiological Indices of Audiovisual Speech Processing Reveal a Hierarchy of Multisensory Integration Effects.

Abstract

Talk to us

Similar Papers

More From: The Journal of Neuroscience

Lead the way for us

Journal: The Journal of Neuroscience	Publication Date: Apr 6, 2021
Citations: 37

Similar Papers

Audiovisual Speech Enhancement via Cross-Modal Suppression of Auditory Association Cortex by Visual Speech
Patrick J Karas ... Brian A Metzger
Neurosurgery | VOL. 66
Patrick J Karas, et. al.Patrick J Karas ... Brian A Metzger
20 Aug 2019
Neurosurgery | VOL. 66

Eye Can Hear Clearly Now: Inverse Effectiveness in Natural Audiovisual Speech Processing Relies on Long-Term Crossmodal Temporal Integration.
Michael J Crosse ... Edmund C Lalor
The Journal of Neuroscience | VOL. 36
Michael J Crosse, et. al.Michael J Crosse ... Edmund C Lalor
21 Sep 2016
The Journal of Neuroscience | VOL. 36

Differential Auditory and Visual Phase-Locking Are Observed during Audio-Visual Benefit and Silent Lip-Reading for Speech Perception.
Máté Aller ... Matthew H Davis
The Journal of neuroscience : the official journal of the Society for Neuroscience | VOL. 42
Máté Aller, et. al.Máté Aller ... Matthew H Davis
27 Jun 2022
The Journal of neuroscience : the official journal of the Society for Neuroscience | VOL. 42

Phonetic recalibration of visual and auditory speech by visual sentences
Josh Dorsi ... Sharon Chee
The Journal of the Acoustical Society of America | VOL. 144
Josh Dorsi, et. al.Josh Dorsi ... Sharon Chee
01 Sep 2018
The Journal of the Acoustical Society of America | VOL. 144

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neurophysiological Indices of Audiovisual Speech Processing Reveal a Hierarchy of Multisensory Integration Effects.

Abstract

Talk to us

Similar Papers

More From: The Journal of Neuroscience