Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

Maansi Desai,Nat Clark,Cassandra Villarreal,Liberty S Hamilton,Jade Holder,Brittany Hoang

doi:10.1523/jneurosci.2891-20.2021

Abstract

In natural conversations, listeners must attend to what others are saying while ignoring extraneous background sounds. Recent studies have used encoding models to predict electroencephalography (EEG) responses to speech in noise-free listening situations, sometimes referred to as "speech tracking." Researchers have analyzed how speech tracking changes with different types of background noise. It is unclear, however, whether neural responses from acoustically rich, naturalistic environments with and without background noise can be generalized to more controlled stimuli. If encoding models for acoustically rich, naturalistic stimuli are generalizable to other tasks, this could aid in data collection from populations of individuals who may not tolerate listening to more controlled and less engaging stimuli for long periods of time. We recorded noninvasive scalp EEG while 17 human participants (8 male/9 female) listened to speech without noise and audiovisual speech stimuli containing overlapping speakers and background sounds. We fit multivariate temporal receptive field encoding models to predict EEG responses to pitch, the acoustic envelope, phonological features, and visual cues in both stimulus conditions. Our results suggested that neural responses to naturalistic stimuli were generalizable to more controlled datasets. EEG responses to speech in isolation were predicted accurately using phonological features alone, while responses to speech in a rich acoustic background were more accurate when including both phonological and acoustic features. Our findings suggest that naturalistic audiovisual stimuli can be used to measure receptive fields that are comparable and generalizable to more controlled audio-only stimuli.SIGNIFICANCE STATEMENT Understanding spoken language in natural environments requires listeners to parse acoustic and linguistic information in the presence of other distracting stimuli. However, most studies of auditory processing rely on highly controlled stimuli with no background noise, or with background noise inserted at specific times. Here, we compare models where EEG data are predicted based on a combination of acoustic, phonetic, and visual features in highly disparate stimuli-sentences from a speech corpus and speech embedded within movie trailers. We show that modeling neural responses to highly noisy, audiovisual movies can uncover tuning for acoustic and phonetic information that generalizes to simpler stimuli typically used in sensory neuroscience experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of neuroscience : the official journal of the Society for Neuroscience	Publication Date: Sep 9, 2021
Citations: 17	License type: CC BY-NC-SA 4.0

R Discovery Prime

R Discovery Prime

Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

Abstract

Talk to us

Similar Papers

More From: The Journal of neuroscience : the official journal of the Society for Neuroscience

Lead the way for us

Similar Papers

Exploiting complementary aspects of phonological features in automatic speech recognition
Parya Momayyez ... James Waterhouse
-
Parya Momayyez, et. al.Parya Momayyez ... James Waterhouse
01 Jan 2007
01 Jan 2007

Significance of Phonological Features in Speech Emotion Recognition
Wei Wang ... Lingjie Shen
International Journal of Speech Technology | VOL. 23
Wei Wang, et. al.Wei Wang ... Lingjie Shen
15 Jul 2020
International Journal of Speech Technology | VOL. 23

A tradeoff between acoustic and linguistic feature encoding in spoken language comprehension.
Filiz Tezcan ... Hugo Weissbart
eLife | VOL. 12
Filiz Tezcan, et. al.Filiz Tezcan ... Hugo Weissbart
07 Jul 2023
eLife | VOL. 12

Identifying Core Affect in Individuals from fMRI Responses to Dynamic Naturalistic Audiovisual Stimuli.
Jongwan Kim ... Douglas H Wedell
PLOS ONE | VOL. 11
Jongwan Kim, et. al.Jongwan Kim ... Douglas H Wedell
06 Sep 2016
PLOS ONE | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

Abstract

Talk to us

Similar Papers

More From: The Journal of neuroscience : the official journal of the Society for Neuroscience