Decoding speech perception from non-invasive brain recordings

Alexandre Défossez,Charlotte Caucheteux,Jérémy Rapin,Ori Kabeli,Jean-Rémi King

doi:10.1038/s42256-023-00714-5

Abstract

Decoding speech from brain activity is a long-awaited goal in both healthcare and neuroscience. Invasive devices have recently led to major milestones in this regard: deep-learning algorithms trained on intracranial recordings can now start to decode elementary linguistic features such as letters, words and audio-spectrograms. However, extending this approach to natural speech and non-invasive brain recordings remains a major challenge. Here we introduce a model trained with contrastive learning to decode self-supervised representations of perceived speech from the non-invasive recordings of a large cohort of healthy individuals. To evaluate this approach, we curate and integrate four public datasets, encompassing 175 volunteers recorded with magneto-encephalography or electro-encephalography while they listened to short stories and isolated sentences. The results show that our model can identify, from 3 seconds of magneto-encephalography signals, the corresponding speech segment with up to 41% accuracy out of more than 1,000 distinct possibilities on average across participants, and with up to 80% in the best participants—a performance that allows the decoding of words and phrases absent from the training set. The comparison of our model with a variety of baselines highlights the importance of a contrastive objective, pretrained representations of speech and a common convolutional architecture simultaneously trained across multiple participants. Finally, the analysis of the decoder’s predictions suggests that they primarily depend on lexical and contextual semantic representations. Overall, this effective decoding of perceived speech from non-invasive recordings delineates a promising path to decode language from brain activity, without putting patients at risk of brain surgery.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Machine Intelligence	Publication Date: Oct 1, 2023
Citations: 34	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Decoding speech perception from non-invasive brain recordings

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence

Lead the way for us

Similar Papers

Simultaneous invasive and non-invasive recordings in humans: A novel Rosetta stone for deciphering brain activity
Andrea Pigorini ... Olivier David
Journal of Neuroscience Methods | VOL. 408
Andrea Pigorini, et. al.Andrea Pigorini ... Olivier David
09 May 2024
Journal of Neuroscience Methods | VOL. 408

Characteristics of Waveform Shape in Parkinson's Disease Detected with Scalp Electroencephalography.
Nicko Jackson ... Nicole C Swann
eneuro | VOL. 6
Nicko Jackson, et. al.Nicko Jackson ... Nicole C Swann
01 May 2019
eneuro | VOL. 6

Secondary predication and the lexical representation of verbs
T R Rapoport
Machine Translation | VOL. 5
T R RapoportT R Rapoport
01 Mar 1990
Machine Translation | VOL. 5

The D1152H cystic fibrosis mutation in prenatal carrier screening, patients and prenatal diagnosis
Leah Peleg ... Hagith Yonath
Journal of Medical Screening | VOL. 18
Leah Peleg, et. al.Leah Peleg ... Hagith Yonath
01 Dec 2011
The D1152H cystic fibrosis mutation in prenatal carrier screening, patients and prenatal diagnosis
Leah Peleg ... Hagith Yonath

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decoding speech perception from non-invasive brain recordings

Abstract

Talk to us

Similar Papers

More From: Nature Machine Intelligence