Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals.

Junbo Chen,Yao Wang,Adeen Flinker,Erika Jensen,Ran Wang,Chenqian Le,Orrin Devinsky,Daniel Friedman,Werner Doyle,Patricia Dugan,Xupeng Chen,Amirhossein Khalilian-Gourtani

doi:10.1101/2024.03.11.584533

Abstract

This study investigates speech decoding from neural signals captured by intracranial electrodes. Most prior works can only work with electrodes on a 2D grid (i.e., Electrocorticographic or ECoG array) and data from a single patient. We aim to design a deep-learning model architecture that can accommodate both surface (ECoG) and depth (stereotactic EEG or sEEG) electrodes. The architecture should allow training on data from multiple participants with large variability in electrode placements and the trained model should perform well on participants unseen during training. We propose a novel transformer-based model architecture named SwinTW that can work with arbitrarily positioned electrodes by leveraging their 3D locations on the cortex rather than their positions on a 2D grid. We train subject-specific models using data from a single participant and multi-patient models exploiting data from multiple participants. The subject-specific models using only low-density 8×8 ECoG data achieved high decoding Pearson Correlation Coefficient with ground truth spectrogram (PCC=0.817), over N=43 participants, outperforming our prior convolutional ResNet model and the 3D Swin transformer model. Incorporating additional strip, depth, and grid electrodes available in each participant (N=39) led to further improvement (PCC=0.838). For participants with only sEEG electrodes (N=9), subject-specific models still enjoy comparable performance with an average PCC=0.798. The multi-subject models achieved high performance on unseen participants, with an average PCC=0.765 in leave-one-out cross-validation. The proposed SwinTW decoder enables future speech neuropros-theses to utilize any electrode placement that is clinically optimal or feasible for a particular participant, including using only depth electrodes, which are more routinely implanted in chronic neurosurgical procedures. Importantly, the generalizability of the multi-patient models suggests that such a model can be applied to new patients that do not have paired acoustic and neural data, providing an advance in neuroprostheses for people with speech disability, where acoustic-neural training data is not feasible.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: bioRxiv : the preprint server for biology	Publication Date: Sep 25, 2024
Citations: 1	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals.

Abstract

Talk to us

Similar Papers

More From: bioRxiv : the preprint server for biology

Lead the way for us

Similar Papers

Probabilistic comparison of gray and white matter coverage between depth and surface intracranial electrodes in epilepsy
Daria Nesterovich Anderson ... Tyler S Davis
Scientific Reports | VOL. 11
Daria Nesterovich Anderson, et. al.Daria Nesterovich Anderson ... Tyler S Davis
01 Dec 2021
Scientific Reports | VOL. 11

Simultaneous subdural grid and depth electrodes in patients with refractory complex partial seizures
Elizabeth Barry ... Allan Krumholz
Journal of Epilepsy | VOL. 5
Elizabeth Barry, et. al.Elizabeth Barry ... Allan Krumholz
01 Jan 1992
Journal of Epilepsy | VOL. 5

Simultaneous Frame-assisted Stereotactic Placement of Subdural Grid Electrodes and Intracerebral Depth Electrodes.
Daniel Delev ... Marie T Krüger
Journal of Neurological Surgery Part A: Central European Neurosurgery | VOL. 80
Daniel Delev, et. al.Daniel Delev ... Marie T Krüger
13 May 2019
Journal of Neurological Surgery Part A: Central European Neurosurgery | VOL. 80

Intraoperative computed tomography for intracranial electrode implantation surgery in medically refractory epilepsy.
Darrin J Lee ... Masud Seyal
Journal of neurosurgery | VOL. 122
Darrin J Lee, et. al.Darrin J Lee ... Masud Seyal
31 Oct 2014
Journal of neurosurgery | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals.

Abstract

Talk to us

Similar Papers

More From: bioRxiv : the preprint server for biology